Antisense modulation of LIM domain kinase 1 expression

ABSTRACT

Antisense compounds, compositions and methods are provided for modulating the expression of LIM domain kinase 1. The compositions comprise antisense compounds, particularly antisense oligonucleotides, targeted to nucleic acids encoding LIM domain kinase 1. Methods of using these compounds for modulation of LIM domain kinase 1 expression and for treatment of diseases associated with expression of LIM domain kinase 1 are provided.

FIELD OF THE INVENTION

[0001] The present invention provides compositions and methods for modulating the expression of LIM domain kinase 1. In particular, this invention relates to compounds, particularly oligonucleotides, specifically hybridizable with nucleic acids encoding LIM domain kinase 1. Such compounds have been shown to modulate the expression of LIM domain kinase 1.

BACKGROUND OF THE INVENTION

[0002] One of the principal mechanisms by which cellular regulation is effected is through the transduction of extracellular signals across the membrane that in turn modulate biochemical pathways within the cell. Protein phosphorylation represents one course by which intracellular signals are propagated from molecule to molecule resulting finally in a cellular response. These signal transduction cascades are tightly regulated and often overlap as evidenced by the existence of multiple protein kinase and phosphatase families and isoforms.

[0003] Because phosphorylation is such a ubiquitous process within cells and because cellular phenotypes are largely influenced by the activity of these pathways, it is currently believed that a number of disease states and/or disorders are a result of either aberrant activation or functional mutations in the molecular components of these cascades. Consequently, considerable attention has been devoted to the characterization of proteins exhibiting either kinase or phosphatase enzymatic activity.

[0004] LIM domain kinase 1 (also known as LIM motif-containing protein kinase, LIMK and LIMK1) and its relative, LIM domain kinase 2, constitute a novel subclass of serine/threonine kinases with two tandemly arranged LIM domains. The LIM domain is a zinc finger structure that is present in several types of proteins, including homeodomain transcription factors and kinases. Proteins containing LIM domains have been discovered to play important roles in a variety of fundamental biological processes including cytoskeleton organization, cell lineage specification and organ development as well as pathological functions such as oncogenesis (Bach, Mech. Dev., 2000, 91, 5-17).

[0005] LIM domain kinase 1 was cloned and localized to chromosome 7q11.23 (Mizuno et al., Oncogene, 1994, 9, 1605-1612; Okano et al., J. Biol. Chem., 1995, 270, 31321-31330). It is expressed in various human epithelial and hematopoietic cell lines and has been also found to be expressed at high levels in rat brain (Mizuno et al., Oncogene, 1994, 9, 1605-1612). Known splice variants of LIM domain kinase 1 include dLIMK, (Mizuno et al., Oncogene, 1994, 9, 1605-1612) and a variant herein designated LIMK hypothetical-1.

[0006] Nucleic acid sequences encoding LIM domain kinase 1 are disclosed in U.S. Pat. No. 5,858,662 and PCT publication WO 01/94629 (Keating and Morris, 1999; Young et al., 2001).

[0007] LIM domain kinase 1 has been identified as a potent activator of serum response factor, which in turn, regulates transcription of many serum-inducible and muscle-specific genes (Sotiropoulos et al., Cell, 1999, 98, 159-169). Maekawa et al. have demonstrated that LIM domain kinase 1 is phosphorylated and activated by ROCK, a downstream effector of Rho, and that LIM domain kinase 1, in turn, phosphorylates cofilin, inhibiting its actin-depolymerizing activity. It was concluded that this pathway contributes to Rho-induced reorganization of the actin cytoskeleton (Maekawa et al., Science, 1999, 285, 895-898).

[0008] Deletions of the LIM domain kinase 1 gene are associated with Williams syndrome, a neurodevelopmental disorder resulting in specific physical, behavioral and cognitive abnormalities in the developing brain (Donnai and Karmiloff-Smith, Am. J. Med. Genet., 2000, 97, 164-171; Lawler, Curr. Biol., 1999, 9, R800-802; Meng et al., Hum. Genet., 1998, 103, 590-599).

[0009] The involvement of LIM domain kinase 1 in actin-depolymerization and activation of serum response factor indicates that its selective modulation may prove to be a useful strategy for therapeutic intervention in developmental and neurological disorders.

[0010] Higuchi et al. have investigated the effects of negative regulation of LIM domain kinase 1 by transfection of NIH3T3 and H-ras-transformed FYJ10 fibroblasts with antisense LIM domain kinase 1 cDNA and observed stimulated colony formation in the NIH3T3 cells (Higuchi et al., FEBS Lett., 1996, 396, 81-86).

[0011] Disclosed and claimed in PCT publication WO 01/94629 is a process for treating cancer comprising contacting a cancerous cell with an agent having activity against an expression product encoded by a LIM domain kinase 1 gene wherein said agent is an antibody or apoptosis-inducing agent (Young et al., 2001).

[0012] Disclosed and claimed in PCT publication WO 01/30964 is a method for arresting cancer comprising inhibiting a group of cancer-associated molecules, including LIM domain kinase 1, wherein overexpression of the cancer-associated molecule is indicative of cancer and wherein said cancer-associated molecule is inhibited using an antisense polynucleotide or and antibody (Burmer et al., 2001).

[0013] Currently, there are no known therapeutic agents that effectively inhibit the synthesis of LIM domain kinase 1. To date, investigative strategies aimed at modulating LIM domain kinase 1 expression have involved the use of antibodies and antisense cDNA. Consequently, there remains a long felt need for additional agents capable of effectively inhibiting LIM domain kinase 1 function.

[0014] Antisense technology is emerging as an effective means for reducing the expression of specific gene products and may therefore prove to be uniquely useful in a number of therapeutic, diagnostic, and research applications for the modulation of expression of LIM domain kinase 1.

[0015] The present invention provides compositions and methods for modulating expression of LIM domain kinase 1, including modulation of variants of LIM domain kinase 1.

SUMMARY OF THE INVENTION

[0016] The present invention is directed to compounds, particularly antisense oligonucleotides, which are targeted to a nucleic acid encoding LIM domain kinase 1, and which modulate the expression of LIM domain kinase 1. Pharmaceutical and other compositions comprising the compounds of the invention are also provided. Further provided are methods of modulating the expression of LIM domain kinase 1 in cells or tissues comprising contacting said cells or tissues with one or more of the antisense compounds or compositions of the invention. Further provided are methods of treating an animal, particularly a human, suspected of having or being prone to a disease or condition associated with expression of LIM domain kinase 1 by administering a therapeutically or prophylactically effective amount of one or more of the antisense compounds or compositions of the invention.

DETAILED DESCRIPTION OF THE INVENTION

[0017] The present invention employs oligomeric compounds, particularly antisense oligonucleotides, for use in modulating the function of nucleic acid molecules encoding LIM domain kinase 1, ultimately modulating the amount of LIM domain kinase 1 produced. This is accomplished by providing antisense compounds which specifically hybridize with one or more nucleic acids encoding LIM domain kinase 1. As used herein, the terms “target nucleic acid” and “nucleic acid encoding LIM domain kinase 1” encompass DNA encoding LIM domain kinase 1, RNA (including pre-mRNA and mRNA) transcribed from such DNA, and also cDNA derived from such RNA. The specific hybridization of an oligomeric compound with its target nucleic acid interferes with the normal function of the nucleic acid. This modulation of function of a target nucleic acid by compounds which specifically hybridize to it is generally referred to as “antisense”. The functions of DNA to be interfered with include replication and transcription. The functions of RNA to be interfered with include all vital functions such as, for example, translocation of the RNA to the site of protein translation, translocation of the RNA to sites within the cell which are distant from the site of RNA synthesis, translation of protein from the RNA, splicing of the RNA to yield one or more mRNA species, and catalytic activity which may be engaged in or facilitated by the RNA. The overall effect of such interference with target nucleic acid function is modulation of the expression of LIM domain kinase 1 In the context of the present invention, “modulation” means either an increase (stimulation) or a decrease (inhibition) in the expression of a gene. In the context of the present invention, inhibition is the preferred form of modulation of gene expression and mRNA is a preferred target.

[0018] It is preferred to target specific nucleic acids for antisense. “Targeting” an antisense compound to a particular nucleic acid, in the context of this invention, is a multistep process. The process usually begins with the identification of a nucleic acid sequence whose function is to be modulated. This may be, for example, a cellular gene (or mRNA transcribed from the gene) whose expression is associated with a particular disorder or disease state, or a nucleic acid molecule from an infectious agent. In the present invention, the target is a nucleic acid molecule encoding LIM domain kinase 1. The targeting process also includes determination of a site or sites within this gene for the antisense interaction to occur such that the desired effect, e.g., detection or modulation of expression of the protein, will result. Within the context of the present invention, a preferred intragenic site is the region encompassing the translation initiation or termination codon of the open reading frame (ORF) of the gene. Since, as is known in the art, the translation initiation codon is typically 5′-AUG (in transcribed mRNA molecules; 5′-ATG in the corresponding DNA molecule), the translation initiation codon is also referred to as the “AUG codon,” the “start codon” or the “AUG start codon”. A minority of genes have a translation initiation codon having the RNA sequence 5′-GUG, 5′-UUG or 5′-CUG, and 5′-AUA, 5′-ACG and 5′-CUG have been shown to function in vivo. Thus, the terms “translation initiation codon” and “start codon” can encompass many codon sequences, even though the initiator amino acid in each instance is typically methionine in (eukaryotes) or formylmethionine (in prokaryotes). It is also known in the art that eukaryotic and prokaryotic genes may have two or more alternative start codons, any one of which may be preferentially utilized for translation initiation in a particular cell type or tissue, or under a particular set of conditions. In the context of the invention, “start codon” and “translation initiation codon” refer to the codon or codons that are used in vivo to initiate translation of an mRNA molecule transcribed from a gene encoding LIM domain kinase 1, regardless of the sequence(s) of such codons.

[0019] It is also known in the art that a translation termination codon (or “stop codon”) of a gene may have one of three sequences, i.e., 5′-UAA, 5′-UAG and 5′-UGA (the corresponding DNA sequences are 5′-TAA, 5′-TAG and 5′-TGA, respectively). The terms “start codon region” and “translation initiation codon region” refer to a portion of such an mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5′ or 3′) from a translation initiation codon. Similarly, the terms “stop codon region” and “translation termination codon region” refer to a portion of such an mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5′ or 3′) from a translation termination codon.

[0020] The open reading frame (ORF) or “coding region,” which is known in the art to refer to the region between the translation initiation codon and the translation termination codon, is also a region which may be targeted effectively. Other target regions include the 5′ untranslated region (5′UTR), known in the art to refer to the portion of an mRNA in the 5′ direction from the translation initiation codon, and thus including nucleotides between the 5′ cap site and the translation initiation codon of an mRNA or corresponding nucleotides on the gene, and the 3′ untranslated region (3′UTR), known in the art to refer to the portion of an mRNA in the 3′ direction from the translation termination codon, and thus including nucleotides between the translation termination codon and 3′ end of an mRNA or corresponding nucleotides on the gene. The 5′ cap of an mRNA comprises an N7-methylated guanosine residue joined to the 5′-most residue of the mRNA via a 5′-5′ triphosphate linkage. The 5′ cap region of an mRNA is considered to include the 5′ cap structure itself as well as the first 50 nucleotides adjacent to the cap. The 5′ cap region may also be a preferred target region.

[0021] Although some eukaryotic mRNA transcripts are directly translated, many contain one or more regions, known as “introns,” which are excised from a transcript before it is translated. The remaining (and therefore translated) regions are known as “exons” and are spliced together to form a continuous mRNA sequence. mRNA splice sites, i.e., intron-exon junctions, may also be preferred target regions, and are particularly useful in situations where aberrant splicing is implicated in disease, or where an overproduction of a particular mRNA splice product is implicated in disease. Aberrant fusion junctions due to rearrangements or deletions are also preferred targets. mRNA transcripts produced via the process of splicing of two (or more) mRNAs from different gene sources are known as “fusion transcripts”. It has also been found that introns can be effective, and therefore preferred, target regions for antisense compounds targeted, for example, to DNA or pre-mRNA.

[0022] It is also known in the art that alternative RNA transcripts can be produced from the same genomic region of DNA. These alternative transcripts are generally known as “variants”. More specifically, “pre-mRNA variants” are transcripts produced from the same genomic DNA that differ from other transcripts produced from the same genomic DNA in either their start or stop position and contain both intronic and extronic regions.

[0023] Upon excision of one or more exon or intron regions or portions thereof during splicing, pre-mRNA variants produce smaller “mRNA variants”. Consequently, mRNA variants are processed pre-mRNA variants and each unique pre-mRNA variant must always produce a unique mRNA variant as a result of splicing. These mRNA variants are also known as “alternative splice variants”. If no splicing of the pre-mRNA variant occurs then the pre-mRNA variant is identical to the mRNA variant.

[0024] It is also known in the art that variants can be produced through the use of alternative signals to start or stop transcription and that pre-mRNAs and mRNAs can possess more that one start codon or stop codon. Variants that originate from a pre-mRNA or mRNA that use alternative start codons are known as “alternative start variants” of that pre-mRNA or mRNA. Those transcripts that use an alternative stop codon are known as “alternative stop variants” of that pre-mRNA or mRNA. One specific type of alternative stop variant is the “polyA variant” in which the multiple transcripts produced result from the alternative selection of one of the “polyA stop signals” by the transcription machinery, thereby producing transcripts that terminate at unique polyA sites.

[0025] Once one or more target sites have been identified, oligonucleotides are chosen which are sufficiently complementary to the target, i.e., hybridize sufficiently well and with sufficient specificity, to give the desired effect.

[0026] In the context of this invention, “hybridization” means hydrogen bonding, which may be Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary nucleoside or nucleotide bases. For example, adenine and thymine are complementary nucleobases which pair through the formation of hydrogen bonds. “Complementary,” as used herein, refers to the capacity for precise pairing between two nucleotides. For example, if a nucleotide at a certain position of an oligonucleotide is capable of hydrogen bonding with a nucleotide at the same position of a DNA or RNA molecule, then the oligonucleotide and the DNA or RNA are considered to be complementary to each other at that position. The oligonucleotide and the DNA or RNA are complementary to each other when a sufficient number of corresponding positions in each molecule are occupied by nucleotides which can hydrogen bond with each other. Thus, “specifically hybridizable” and “complementary” are terms which are used to indicate a sufficient degree of complementarity or precise pairing such that stable and specific binding occurs between the oligonucleotide and the DNA or RNA target. It is understood in the art that the sequence of an antisense compound need not be 100% complementary to that of its target nucleic acid to be specifically hybridizable.

[0027] An antisense compound is specifically hybridizable when binding of the compound to the target DNA or RNA molecule interferes with the normal function of the target DNA or RNA to cause a loss of activity, and there is a sufficient degree of complementarity to avoid non-specific binding of the antisense compound to non-target sequences under conditions in which specific binding is desired, i.e., under physiological conditions in the case of in vivo assays or therapeutic treatment, and in the case of in vitro assays, under conditions in which the assays are performed. It is preferred that the antisense compounds of the present invention comprise at least 80% sequence complementarity to a target region within the target nucleic acid, moreover that they comprise 90% sequence complementarity and even more comprise 95% sequence complementarity to the target region within the target nucleic acid sequence to which they are targeted. For example, an antisense compound in which 18 of 20 nucleobases of the antisense compound are complementary, and would therefore specifically hybridize, to a target region would represent 90 percent complementarity. Percent complementarity of an antisense compound with a region of a target nucleic acid can be determined routinely using basic local alignment search tools (BLAST programs) (Altschul et al., J. Mol. Biol., 1990, 215, 403-410; Zhang and Madden, Genome Res., 1997, 7, 649-656).

[0028] Antisense and other compounds of the invention, which hybridize to the target and inhibit expression of the target, are identified through experimentation, and representative sequences of these compounds are hereinbelow identified as preferred embodiments of the invention. The sites to which these preferred antisense compounds are specifically hybridizable are hereinbelow referred to as “preferred target regions” and are therefore preferred sites for targeting. As used herein the term “preferred target region” is defined as at least an 8-nucleobase portion of a target region to which an active antisense compound is targeted. While not wishing to be bound by theory, it is presently believed that these target regions represent regions of the target nucleic acid which are accessible for hybridization.

[0029] While the specific sequences of particular preferred target regions are set forth below, one of skill in the art will recognize that these serve to illustrate and describe particular embodiments within the scope of the present invention. Additional preferred target regions may be identified by one having ordinary skill.

[0030] Target regions 8-80 nucleobases in length comprising a stretch of at least eight (8) consecutive nucleobases selected from within the illustrative preferred target regions are considered to be suitable preferred target regions as well.

[0031] Exemplary good preferred target regions include DNA or RNA sequences that comprise at least the 8 consecutive nucleobases from the 5′-terminus of one of the illustrative preferred target regions the remaining nucleobases being a consecutive stretch of the same DNA or RNA beginning immediately upstream of the 5′-terminus of the target region and continuing until the DNA or RNA contains about 8 to about 80 nucleobases). Similarly good preferred target regions are represented by DNA or RNA sequences that comprise at least the 8 consecutive nucleobases from the 3′-terminus of one of the illustrative preferred target regions the remaining nucleobases being a consecutive stretch of the same DNA or RNA beginning immediately downstream of the 3′-terminus of the target region and continuing until the DNA or RNA contains about 8 to about 80 nucleobases). One having skill in the art, once armed with the empirically-derived preferred target regions illustrated herein will be able, without undue experimentation, to identify further preferred target regions. In addition, one having ordinary skill in the art will also be able to identify additional compounds, including oligonucleotide probes and primers, that specifically hybridize to these preferred target regions using techniques available to the ordinary practitioner in the art.

[0032] Antisense compounds are commonly used as research reagents and diagnostics. For example, antisense oligonucleotides, which are able to inhibit gene expression with exquisite specificity, are often used by those of ordinary skill to elucidate the function of particular genes. Antisense compounds are also used, for example, to distinguish between functions of various members of a biological pathway. Antisense modulation has, therefore, been harnessed for research use.

[0033] For use in kits and diagnostics, the antisense compounds of the present invention, either alone or in combination with other antisense compounds or therapeutics, can be used as tools in differential and/or combinatorial analyses to elucidate expression patterns of a portion or the entire complement of genes expressed within cells and tissues.

[0034] Expression patterns within cells or tissues treated with one or more antisense compounds are compared to control cells or tissues not treated with antisense compounds and the patterns produced are analyzed for differential levels of gene expression as they pertain, for example, to disease association, signaling pathway, cellular localization, expression level, size, structure or function of the genes examined. These analyses can be performed on stimulated or unstimulated cells and in the presence or absence of other compounds which affect expression patterns.

[0035] Examples of methods of gene expression analysis known in the art include DNA arrays or microarrays (Brazma and Vilo, FEBS Lett., 2000, 480, 17-24; Celis, et al., FEBS Lett., 2000, 480, 2-16), SAGE (serial analysis of gene expression) (Madden, et al., Drug Discov. Today, 2000, 5, 415-425), READS restriction enzyme amplification of digested cDNAs) (Prashar and Weissman, Methods Enzyymol., 1999, 303, 258-72), TOGA (total gene expression analysis) (Sutcliffe, et al., Proc. Natl. Acad. Sci. U.S.A., 2000, 97, 1976-81), protein arrays and proteomics (Celis, et al., FEBS Lett., 2000, 480, 2-16; Jungblut, et al., Electrophoresis, 1999, 20, 2100-10), expressed sequence tag (EST) sequencing (Celis, et al., FEBS Lett., 2000, 480, 2-16; Larsson, et al., J. Biotechnol., 2000, 80, 143-57), subtractive RNA fingerprinting (SuRF) (Fuchs, et al., Anal. Biochem., 2000, 286, 91-98; Larson, et al., Cytometry, 2000, 41, 203-208), subtractive cloning, differential display (DD) (Jurecic and Belmont, Curr. Opin. Microbiol., 2000, 3, 316-21), comparative genomic hybridization (Carulli, et al., J. Cell Biochem. Suppl., 1998, 31, 286-96), FISH (fluorescent in situ hybridization) techniques (Going and Gusterson, Eur. J. Cancer, 1999, 35, 1895-904) and mass spectrometry methods (reviewed in To, Comb. Chem. High Throughput Screen, 2000, 3, 235-41).

[0036] The specificity and sensitivity of antisense is also harnessed by those of skill in the art for therapeutic uses. Antisense oligonucleotides have been employed as therapeutic moieties in the treatment of disease states in animals and man. Antisense oligonucleotide drugs, including ribozymes, have been safely and effectively administered to humans and numerous clinical trials are presently underway. It is thus established that oligonucleotides can be useful therapeutic modalities that can be configured to be useful in treatment regimes for treatment of cells, tissues and animals, especially humans.

[0037] In the context of this invention, the term “oligonucleotide” refers to an oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid DNA) or mimetics thereof. This term includes oligonucleotides composed of naturally-occurring nucleobases, sugars and covalent internucleoside (backbone) linkages as well as oligonucleotides having non-naturally-occurring portions which function similarly. Such modified or substituted oligonucleotides are often preferred over native forms because of desirable properties such as, for example, enhanced cellular uptake, enhanced affinity for nucleic acid target and increased stability in the presence of nucleases.

[0038] While antisense oligonucleotides are a preferred form of antisense compound, the present invention comprehends other oligomeric antisense compounds, including but not limited to oligonucleotide mimetics such as are described below. The antisense compounds in accordance with this invention preferably comprise from about 8 to about 80 nucleobases (i.e. from about 8 to about 80 linked nucleosides). Particularly preferred antisense compounds are antisense oligonucleotides from about 8 to about 50 nucleobases, even more preferably those comprising from about 12 to about 30 nucleobases. Antisense compounds include ribozymes, external guide sequence (EGS) oligonucleotides (oligozymes), and other short catalytic RNAs or catalytic oligonucleotides which hybridize to the target nucleic acid and modulate its expression.

[0039] Antisense compounds 8-80 nucleobases in length comprising a stretch of at least eight (8) consecutive nucleobases selected from within the illustrative antisense compounds are considered to be suitable antisense compounds as well.

[0040] Exemplary preferred antisense compounds include DNA or RNA sequences that comprise at least the 8 consecutive nucleobases from the 5′-terminus of one of the illustrative preferred antisense compounds (the remaining nucleobases being a consecutive stretch of the same DNA or RNA beginning immediately upstream of the 5′-terminus of the antisense compound which is specifically hybridizable to the target nucleic acid and continuing until the DNA or RNA contains about 8 to about 80 nucleobases). Similarly preferred antisense compounds are represented by DNA or RNA sequences that comprise at least the 8 consecutive nucleobases from the 3′-terminus of one of the illustrative preferred antisense compounds (the remaining nucleobases being a consecutive stretch of the same DNA or RNA beginning immediately downstream of the 3′-terminus of the antisense compound which is specifically hybridizable to the target nucleic acid and continuing until the DNA or RNA contains about 8 to about 80 nucleobases). One having skill in the art, once armed with the empirically-derived preferred antisense compounds illustrated herein will be able, without undue experimentation, to identify further preferred antisense compounds.

[0041] Antisense and other compounds of the invention, which hybridize to the target and inhibit expression of the target, are identified through experimentation, and representative sequences of these compounds are herein identified as preferred embodiments of the invention. While specific sequences of the antisense compounds are set forth herein, one of skill in the art will recognize that these serve to illustrate and describe particular embodiments within the scope of the present invention. Additional preferred antisense compounds may be identified by one having ordinary skill.

[0042] As is known in the art, a nucleoside is a base-sugar combination. The base portion of the nucleoside is normally a heterocyclic base. The two most common classes of such heterocyclic bases are the purines and the pyrimidines. Nucleotides are nucleosides that further include a phosphate group covalently linked to the sugar portion of the nucleoside. For those nucleosides that include a pentofuranosyl sugar, the phosphate group can be linked to either the 2′, 3′ or 5′ hydroxyl moiety of the sugar. In forming oligonucleotides, the phosphate groups covalently link adjacent nucleosides to one another to form a linear polymeric compound. In turn, the respective ends of this linear polymeric structure can be further joined to form a circular structure, however, open linear structures are generally preferred. In addition, linear structures may also have internal nucleobase complementarity and may therefore fold in a manner as to produce a double stranded structure. Within the oligonucleotide structure, the phosphate groups are commonly referred to as forming the internucleoside backbone of the oligonucleotide. The normal linkage or backbone of RNA and DNA is a 3′ to 5′ phosphodiester linkage.

[0043] Specific examples of preferred antisense compounds useful in this invention include oligonucleotides containing modified backbones or non-natural internucleoside linkages. As defined in this specification, oligonucleotides having modified backbones include those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone. For the purposes of this specification, and as sometimes referenced in the art, modified oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can also be considered to be oligonucleosides.

[0044] Preferred modified oligonucleotide backbones include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotri-esters, methyl and other alkyl phosphonates including 3′-alkylene phosphonates, 5′-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3′-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, selenophosphates and borano-phosphates having normal 3′-5′ linkages, 2′-5′ linked analogs of these, and those having inverted polarity wherein one or more internucleotide linkages is a 3′ to 3′, 5′ to 5′ or 2′ to 2′ linkage. Preferred oligonucleotides having inverted polarity comprise a single 3′ to 3′ linkage at the 3′-most internucleotide linkage i.e. a single inverted nucleoside residue which may be abasic (the nucleobase is missing or has a hydroxyl group in place thereof). Various salts, mixed salts and free acid forms are also included.

[0045] Representative United States patents that teach the preparation of the above phosphorus-containing linkages include, but are not limited to, U.S. Pat. Nos. 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; 5,194,599; 5,565,555; 5,527,899; 5,721,218; 5,672,697 and 5,625,050, certain of which are commonly owned with this application, and each of which is herein incorporated by reference.

[0046] Preferred modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; riboacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH₂ component parts.

[0047] Representative United States patents that teach the preparation of the above oligonucleosides include, but are not limited to, U.S. Pat. Nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; 5,792,608; 5,646,269 and 5,677,439, certain of which are commonly owned with this application, and each of which is herein incorporated by reference.

[0048] In other preferred oligonucleotide mimetics, both the sugar and the internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups. The base units are maintained for hybridization with an appropriate nucleic acid target compound. One such oligomeric compound, an oligonucleotide mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA compounds, the sugar-backbone of an oligonucleotide is replaced with an amide containing backbone, in particular an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative United States patents that teach the preparation of PNA compounds include, but are not limited to, U.S. Pat. Nos. 5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference. Further teaching of PNA compounds can be found in Nielsen et al., Science, 1991, 254, 1497-1500.

[0049] Most preferred embodiments of the invention are oligonucleotides with phosphorothioate backbones and oligonucleosides with heteroatom backbones, and in particular —CH₂—NH—O—CH₂—, —CH₂—N(CH₃)—O—CH₂— [known as a methylene (methylimino) or MMI backbone], —CH₂—O—N(CH₃)—CH₂—, —CH₂—N(CH₃)—N(CH₃)—CH₂— and —O—N(CH₃)—CH₂—CH₂— [wherein the native phosphodiester backbone is represented as —O—P—O—CH₂—] of the above referenced U.S. Pat. No. 5,489,677, and the amide backbones of the above referenced U.S. Pat. No. 5,602,240. Also preferred are oligonucleotides having morpholino backbone structures of the above-referenced U.S. Pat. No. 5,034,506.

[0050] Modified oligonucleotides may also contain one or more substituted sugar moieties. Preferred oligonucleotides comprise one of the following at the 2′ position: OH; F; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or unsubstituted C₁ to C₁₀ alkyl or C₂ to C₁₀ alkenyl and alkynyl. Particularly preferred are O[(CH₂)_(n)O]_(m)CH₃, O(CH₂)_(n)OCH₃, O(CH₂)_(n)NH₂, O(CH₂)_(n)CH₃, O(CH₂)_(n)ONH₂, and O(CH₂)_(n)ON[(CH₂)_(n)CH₃]₂, where n and m are from 1 to about 10. Other preferred oligonucleotides comprise one of the following at the 2′ position: C₁to C₁₀ lower alkyl, substituted lower alkyl, alkenyl, alkynyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH₃, OCN, Cl, Br, CN, CF₃, OCF₃, SOCH₃ , SO₂CH₃, ONO₂, NO₂, N₃, NH₂, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and other substituents having similar properties. A preferred modification includes 2′-methoxyethoxy (2′—O—CH₂CH₂OCH₃, also known as 2′—O—(2-methoxyethyl) or 2′-MOE) (Martin et al., Helv. Chim. Acta, 1995, 78, 486-504) i.e., an alkoxyalkoxy group. A further preferred modification includes 2′-dimethylaminooxyethoxy, i.e., a O(CH₂)₂ON(CH₃)₂ group, also known as 2′-DMAOE, as described in examples hereinbelow, and 2′-dimethylaminoethoxyethoxy (also known in the art as 2′-O-dimethyl-amino-ethoxy-ethyl or 2′-DMAEOE), i.e., 2′—O—CH₂—O—CH₂—N(CH₃)₂, also described in examples hereinbelow.

[0051] Other preferred modifications include 2′-methoxy (2′—O—CH₃), 2′-aminopropoxy (2′—OCH₂CH₂CH₂NH₂), 2′-allyl (2′—CH₂—CH═CH₂), 2′-O-allyl (2′—O—CH₂—CH═CH₂) and 2′-fluoro (2′-F). The 2′-modification may be in the arabino (up) position or ribo (down) position. A preferred 2′-arabino modification is 2′-F. Similar modifications may also be made at other positions on the oligonucleotide, particularly the 3′ position of the sugar on the 3′ terminal nucleotide or in 2′-5′ linked oligonucleotides and the 5′ position of 5′ terminal nucleotide. Oligonucleotides may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. Representative United States patents that teach the preparation of such modified sugar structures include, but are not limited to, U.S. Pat. Nos. 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 5,466,786; 5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 5,639,873; 5,646,265; 5,658,873; 5,670,633; 5,792,747; and 5,700,920, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference in its entirety.

[0052] A further preferred modification includes Locked Nucleic Acids (LNAs) in which the 2′-hydroxyl group is linked to the 3′ or 4′ carbon atom of the sugar ring thereby forming a bicyclic sugar moiety. The linkage is preferably a methelyne (—CH₂—)_(n) group bridging the 2′ oxygen atom and the 4′ carbon atom wherein n is 1 or 2. LNAs and preparation thereof are described in WO 98/39352 and WO 99/14226.

[0053] Oligonucleotides may also include nucleobase (often referred to in the art simply as “base”) modifications or substitutions. As used herein, “unmodified” or “natural” nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl (—C≡C—CH₃) uracil and cytosine and other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 2-F-adenine, 2-amino-adenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further modified nucleobases include tricyclic pyrimidines such as phenoxazine cytidine(1H-pyrimido[5,4-b][1,4]benzoxazin-2(3H)-one), phenothiazine cytidine (1H-pyrimido[5,4-b][1,4]benzothiazin-2(3H)-one), G-clamps such as a substituted phenoxazine cytidine e.g. 9-(2-aminoethoxy)-H-pyrimido[5,4-b][1,4]benzoxazin-2(3H)-one), carbazole cytidine (2H-pyrimido)[4,5-b]indol-2-one), pyridoindole cytidine (H-pyrido)[3′,2′:4,5]pyrrolo[2,3-d]pyrimidin-2-one). Modified nucleobases may also include those in which the purine or pyrimidine base is replaced with other heterocycles, for example 7-deaza-adenine, 7-deazaguanosine, 2-aminopyridine and 2-pyridone. Further nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in The Concise Encyclopedia Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. I., ed. John Wiley & Sons, 1990, those disclosed by Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed by Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B., ed., CRC Press, 1993. Certain of these nucleobases are particularly useful for increasing the binding affinity of the oligomeric compounds of the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and O-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2° C. (Sanghvi, Y. S., Crooke, S. T. and Lebleu, B., eds., Antisense Research and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are presently preferred base substitutions, even more particularly when combined with 2′-O-methoxyethyl sugar modifications.

[0054] Representative United States patents that teach the preparation of certain of the above noted modified nucleobases as well as other modified nucleobases include, but are not limited to, the above noted U.S. Pat. No. 3,687,808, as well as U.S. Pat. Nos. 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; 5,645,985; 5,830,653; 5,763,588; 6,005,096; and 5,681,941, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference, and U.S. Pat. No. 5,750,692, which is commonly owned with the instant application and also herein incorporated by reference.

[0055] Another modification of the oligonucleotides of the invention involves chemically linking to the oligonucleotide one or more moieties or conjugates which enhance the activity, cellular distribution or cellular uptake of the oligonucleotide. The compounds of the invention can include conjugate groups covalently bound to functional groups such as primary or secondary hydroxyl groups. Conjugate groups of the invention include intercalators, reporter molecules, polyamines, polyamides, polyethylene glycols, polyethers, groups that enhance the pharmacodynamic properties of oligomers, and groups that enhance the pharmacokinetic properties of oligomers. Typical conjugate groups include cholesterols, lipids, phospholipids, biotin, phenazine, folate, phenanthridine, anthraquinone, acridine, fluores-ceins, rhodamines, coumarins, and dyes. Groups that enhance the pharmacodynamic properties, in the context of this invention, include groups that improve oligomer uptake, enhance oligomer resistance to degradation, and/or strengthen sequence-specific hybridization with RNA. Groups that enhance the pharmacokinetic properties, in the context of this invention, include groups that improve oligomer uptake, distribution, metabolism or excretion. Representative conjugate groups are disclosed in International Patent Application PCT/US92/09196, filed Oct. 23, 1992 the entire disclosure of which is incorporated herein by reference. Conjugate moieties include but are not limited to lipid moieties such as a cholesterol moiety (Letsinger et al., Proc. Natl. Acad. Sci. USA, 1989, 86, 6553-6556), cholic acid (Manoharan et al., Bioorg. Med. Chem. Let., 1994, 4, 1053-1060), a thioether, e.g., hexyl-S-tritylthiol (Manoharan et al., Ann. N.Y. Acad. Sci., 1992, 660, 306-309; Manoharan et al., Bioorg. Med. Chem. Let., 1993, 3, 2765-2770), a thiocholesterol (Oberhauser et al., Nucl. Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g., dodecandiol or undecyl residues (Saison-Behmoaras et al., EMBO J., 1991, 10, 1111-1118; Kabanov et al., FEBS Lett., 1990, 259, 327-330; Svinarchuk et al., Biochimie, 1993, 75, 49-54), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethyl-ammonium 1,2-di-O-hexadecyl-rac-glycero-3-H-phosphonate (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654; Shea et al., Nucl. Acids Res., 1990, 18, 3777-3783), a polyamine or a polyethylene glycol chain (Manoharan et al., Nucleosides & Nucleotides, 1995, 14, 969-973), or adamantane acetic acid (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654), a palmityl moiety (Mishra et al., Biochim. Biophys. Acta, 1995, 1264, 229-237), or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety (Crooke et al., J. Pharmacol. Exp. Ther., 1996, 277, 923-937). Oligonucleotides of the invention may also be conjugated to active drug substances, for example, aspirin, warfarin, phenylbutazone, ibuprofen, suprofen, fenbufen, ketoprofen, (S)-(+)-pranoprofen, carprofen, dansylsarcosine, 2,3,5-triiodobenzoic acid, flufenamic acid, folinic acid, a benzothiadiazide, chlorothiazide, a diazepine, indomethicin, a barbiturate, a cephalosporin, a sulfa drug, an antidiabetic, an antibacterial or an antibiotic. Oligonucleotide-drug conjugates and their preparation are described in U.S. pat. application Ser. No. 09/334,130 (filed Jun. 15, 1999) which is incorporated herein by reference in its entirety.

[0056] Representative United States pattents that teach the preparation of such oligonucleotide conjugates include, but are not limited to, U.S. Pat. Nos. 4,828,979; 4,948,882; 5,218,105; 5,525,465; 5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731; 5,580,731; 5,591,584; 5,109,124; 5,118,802; 5,138,045; 5,414,077; 5,486,603; 5,512,439; 5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 4,762,779; 4,789,737; 4,824,941; 4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082,830; 5,112,963; 5,214,136; 5,082,830; 5,112,963; 5,214,136; 5,245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 5,292,873; 5,317,098; 5,371,241, 5,391,723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 5,565,552; 5,567,810; 5,574,142; 5,585,481; 5,587,371; 5,595,726; 5,597,696; 5,599,923; 5,599,928 and 5,688,941, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference.

[0057] It is not necessary for all positions in a given compound to be uniformly modified, and in fact more than one of the aforementioned modifications may be incorporated in a single compound or even at a single nucleoside within an oligonucleotide. The present invention also includes antisense compounds which are chimeric compounds. “Chimeric” antisense compounds or “chimeras,” in the context of this invention, are antisense compounds, particularly oligonucleotides, which contain two or more chemically distinct regions, each made up of at least one monomer unit, i.e., a nucleotide in the case of an oligonucleotide compound. These oligonucleotides typically contain at least one region wherein the oligonucleotide is modified so as to confer upon the oligonucleotide increased resistance to nuclease degradation, increased cellular uptake, increased stability and/or increased binding affinity for the target nucleic acid. An additional region of the oligonucleotide may serve as a substrate for enzymes capable of cleaving RNA:DNA or RNA:RNA hybrids. By way of example, RNAse H is a cellular endonuclease which cleaves the RNA strand of an RNA:DNA duplex. Activation of RNase H, therefore, results in cleavage of the RNA target, thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene expression. The cleavage of RNA:RNA hybrids can, in like fashion, be accomplished through the actions of endoribonucleases, such as interferon-induced RNAseL which cleaves both cellular and viral RNA. Consequently, comparable results can often be obtained with shorter oligonucleotides when chimeric oligonucleotides are used, compared to phosphorothioate deoxyoligonucleotides hybridizing to the same target region. Cleavage of the RNA target can be routinely detected by gel electrophoresis and, if necessary, associated nucleic acid hybridization techniques known in the art.

[0058] Chimeric antisense compounds of the invention may be formed as composite structures of two or more oligonucleotides, modified oligonucleotides, oligonucleosides and/or oligonucleotide mimetics as described above. Such compounds have also been referred to in the art as hybrids or gapmers. Representative United States patents that teach the preparation of such hybrid structures include, but are not limited to, U.S. Pat. Nos. 5,013,830; 5,149,797; 5,220,007; 5,256,775; 5,366,878; 5,403,711; 5,491,133; 5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference in its entirety.

[0059] The antisense compounds used in accordance with this invention may be conveniently and routinely made through the well-known technique of solid phase synthesis. Equipment for such synthesis is sold by several vendors including, for example, Applied Biosystems (Foster City, Calif.). Any other means for such synthesis known in the art may additionally or alternatively be employed. It is well known to use similar techniques to prepare oligonucleotides such as the phosphorothioates and alkylated derivatives.

[0060] The compounds of the invention may also be admixed, encapsulated, conjugated or otherwise associated with other molecules, molecule structures or mixtures of compounds, as for example, liposomes, receptor-targeted molecules, oral, rectal, topical or other formulations, for assisting in uptake, distribution and/or absorption. Representative United States patents that teach the preparation of such uptake, distribution and/or absorption-assisting formulations include, but are not limited to, U.S. Pat. Nos. 5,108,921; 5,354,844; 5,416,016; 5,459,127; 5,521,291; 5,543,158; 5,547,932; 5,583,020; 5,591,721; 4,426,330; 4,534,899; 5,013,556; 5,108,921; 5,213,804; 5,227,170; 5,264,221; 5,356,633; 5,395,619; 5,416,016; 5,417,978; 5,462,854; 5,469,854; 5,512,295; 5,527,528; 5,534,259; 5,543,152; 5,556,948; 5,580,575; and 5,595,756, each of which is herein incorporated by reference.

[0061] The antisense compounds of the invention encompass any pharmaceutically acceptable salts, esters, or salts of such esters, or any other compound which, upon administration to an animal, including a human, is capable of providing (directly or indirectly) the biologically active metabolite or residue thereof. Accordingly, for example, the disclosure is also drawn to prodrugs and pharmaceutically acceptable salts of the compounds of the invention, pharmaceutically acceptable salts of such prodrugs, and other bioequivalents.

[0062] The term “prodrug” indicates a therapeutic agent that is prepared in an inactive form that is converted to an active form (i.e., drug) within the body or cells thereof by the action of endogenous enzymes or other chemicals and/or conditions. In particular, prodrug versions of the oligonucleotides of the invention are prepared as SATE [(S-acetyl-2-thioethyl) phosphate] derivatives according to the methods disclosed in WO 93/24510 to Gosselin et al., published Dec. 9, 1993 or in WO 94/26764 and U.S. Pat. No. 5,770,713 to Imbach et al.

[0063] The term “pharmaceutically acceptable salts” refers to physiologically and pharmaceutically acceptable salts of the compounds of the invention: i.e., salts that retain the desired biological activity of the parent compound and do not impart undesired toxicological effects thereto.

[0064] Pharmaceutically acceptable base addition salts are formed with metals or amines, such as alkali and alkaline earth metals or organic amines. Examples of metals used as cations are sodium, potassium, magnesium, calcium, and the like. Examples of suitable amines are N,N′-dibenzylethylenediamine, chloroprocaine, choline, diethanolamine, dicyclohexylamine, ethylenediamine, N-methylglucamine, and procaine (see, for example, Berge et al., “Pharmaceutical Salts,” J. of Pharma Sci., 1977, 66, 1-19). The base addition salts of said acidic compounds are prepared by contacting the free acid form with a sufficient amount of the desired base to produce the salt in the conventional manner. The free acid form may be regenerated by contacting the salt form with an acid and isolating the free acid in the conventional manner. The free acid forms differ from their respective salt forms somewhat in certain physical properties such as solubility in polar solvents, but otherwise the salts are equivalent to their respective free acid for purposes of the present invention. As used herein, a “pharmaceutical addition salt” includes a pharmaceutically acceptable salt of an acid form of one of the components of the compositions of the invention. These include organic or inorganic acid salts of the amines. Preferred acid salts are the hydrochlorides, acetates, salicylates, nitrates and phosphates. Other suitable pharmaceutically acceptable salts are well known to those skilled in the art and include basic salts of a variety of inorganic and organic acids, such as, for example, with inorganic acids, such as for example hydrochloric acid, hydrobromic acid, sulfuric acid or phosphoric acid; with organic carboxylic, sulfonic, sulfo or phospho acids or N-substituted sulfamic acids, for example acetic acid, propionic acid, glycolic acid, succinic acid, maleic acid, hydroxymaleic acid, methylmaleic acid, fumaric acid, malic acid, tartaric acid, lactic acid, oxalic acid, gluconic acid, glucaric acid, glucuronic acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, salicylic acid, 4-aminosalicylic acid, 2-phenoxybenzoic acid, 2-acetoxybenzoic acid, embonic acid, nicotinic acid or isonicotinic acid; and with amino acids, such as the 20 alpha-amino acids involved in the synthesis of proteins in nature, for example glutamic acid or aspartic acid, and also with phenylacetic acid, methanesulfonic acid, ethanesulfonic acid, 2-hydroxyethanesulfonic acid, ethane-1,2-disulfonic acid, benzenesulfonic acid, 4-methylbenzenesulfonic acid, naphthalene-2-sulfonic acid, naphthalene-1,5-disulfonic acid, 2- or 3-phosphoglycerate, glucose-6-phosphate, N-cyclohexylsulfamic acid (with the formation of cyclamates), or with other acid organic compounds, such as ascorbic acid. Pharmaceutically acceptable salts of compounds may also be prepared with a pharmaceutically acceptable cation. Suitable pharmaceutically acceptable cations are well known to those skilled in the art and include alkaline, alkaline earth, ammonium and quaternary ammonium cations. Carbonates or hydrogen carbonates are also possible.

[0065] For oligonucleotides, preferred examples of pharmaceutically acceptable salts include but are not limited to (a) salts formed with cations such as sodium, potassium, ammonium, magnesium, calcium, polyamines such as spermine and spermidine, etc.; (b) acid addition salts formed with inorganic acids, for example hydrochloric acid, hydrobromic acid, sulfuric acid, phosphoric acid, nitric acid and the like; (c) salts formed with organic acids such as, for example, acetic acid, oxalic acid, tartaric acid, succinic acid, maleic acid, fumaric acid, gluconic acid, citric acid, malic acid, ascorbic acid, benzoic acid, tannic acid, palmitic acid, alginic acid, polyglutamic acid, naphthalenesulfonic acid, methanesulfonic acid, p-toluenesulfonic acid, naphthalenedisulfonic acid, polygalacturonic acid, and the like; and (d) salts formed from elemental anions such as chlorine, bromine, and iodine.

[0066] The antisense compounds of the present invention can be utilized for diagnostics, therapeutics, prophylaxis and as research reagents and kits. For therapeutics, an animal, preferably a human, suspected of having a disease or disorder which can be treated by modulating the expression of LIM domain kinase 1 is treated by administering antisense compounds in accordance with this invention. The compounds of the invention can be utilized in pharmaceutical compositions by adding an effective amount of an antisense compound to a suitable pharmaceutically acceptable diluent or carrier. Use of the antisense compounds and methods of the invention may also be useful prophylactically, e.g., to prevent or delay infection, inflammation or tumor formation, for example.

[0067] The antisense compounds of the invention are useful for research and diagnostics, because these compounds hybridize to nucleic acids encoding LIM domain kinase 1, enabling sandwich and other assays to easily be constructed to exploit this fact. Hybridization of the antisense oligonucleotides of the invention with a nucleic acid encoding LIM domain kinase 1 can be detected by means known in the art. Such means may include conjugation of an enzyme to the oligonucleotide, radiolabelling of the oligonucleotide or any other suitable detection means. Kits using such detection means for detecting the level of LIM domain kinase 1 in a sample may also be prepared.

[0068] The present invention also includes pharmaceutical compositions and formulations which include the antisense compounds of the invention. The pharmaceutical compositions of the present invention may be administered in a number of ways depending upon whether local or systemic treatment is desired and upon the area to be treated. Administration may be topical (including ophthalmic and to mucous membranes including vaginal and rectal delivery), pulmonary, e.g., by inhalation or insufflation of powders or aerosols, including by nebulizer; intratracheal, intranasal, epidermal and transdermal), oral or parenteral. Parenteral administration includes intravenous, intraarterial, subcutaneous, intraperitoneal or intramuscular injection or infusion; or intracranial, e.g., intrathecal or intraventricular, administration. Oligonucleotides with at least one 2′-O-methoxyethyl modification are believed to be particularly useful for oral administration.

[0069] Pharmaceutical compositions and formulations for topical administration may include transdermal patches, ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. Coated condoms, gloves and the like may also be useful. Preferred topical formulations include those in which the oligonucleotides of the invention are in admixture with a topical delivery agent such as lipids, liposomes, fatty acids, fatty acid esters, steroids, chelating agents and surfactants. Preferred lipids and liposomes include neutral (e.g. dioleoylphosphatidyl DOPE ethanolamine, dimyristoylphosphatidyl choline DMPC, distearolyphosphatidyl choline) negative (e.g. dimyristoylphosphatidyl glycerol DMPG) and cationic (e.g. dioleoyltetramethylaminopropyl DOTAP and dioleoylphosphatidyl ethanolamine DOTMA). Oligonucleotides of the invention may be encapsulated within liposomes or may form complexes thereto, in particular to cationic liposomes. Alternatively, oligonucleotides may be complexed to lipids, in particular to cationic lipids. Preferred fatty acids and esters include but are not limited arachidonic acid, oleic acid, eicosanoic acid, lauric acid, caprylic acid, capric acid, myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein, dilaurin, glyceryl 1-monocaprate, 1-dodecylazacycloheptan-2-one, an acylcarnitine, an acylcholine, or a C₁₋₁₀ alkyl ester (e.g. isopropylmyristate IPM), monoglyceride, diglyceride or pharmaceutically acceptable salt thereof. Topical formulations are described in detail in U.S. patent application Ser. No. 09/315,298 filed on May 20, 1999 which is incorporated herein by reference in its entirety.

[0070] Compositions and formulations for oral administration include powders or granules, microparticulates, nanoparticulates, suspensions or solutions in water or non-aqueous media, capsules, gel capsules, sachets, tablets or minitablets. Thickeners, flavoring agents, diluents, emulsifiers, dispersing aids or binders may be desirable. Preferred oral formulations are those in which oligonucleotides of the invention are administered in conjunction with one or more penetration enhancers surfactants and chelators. Preferred surfactants include fatty acids and/or esters or salts thereof, bile acids and/or salts thereof. Preferred bile acids/salts include chenodeoxycholic acid (CDCA) and ursodeoxychenodeoxycholic acid UDCA), cholic acid, dehydrocholic acid, deoxycholic acid, glucholic acid, glycholic acid, glycodeoxycholic acid, taurocholic acid, taurodeoxycholic acid, sodium tauro-24,25-dihydro-fusidate and sodium glycodihydrofusidate. Preferred fatty acids include arachidonic acid, undecanoic acid, oleic acid, lauric acid, caprylic acid, capric acid, myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein, dilaurin, glyceryl 1-monocaprate, 1-dodecylazacycloheptan-2-one, an acylcarnitine, an acylcholine, or a monoglyceride, a diglyceride or a pharmaceutically acceptable salt thereof (e.g. sodium). Also preferred are combinations of penetration enhancers, for example, fatty acids/salts in combination with bile acids/salts. A particularly preferred combination is the sodium salt of lauric acid, capric acid and UDCA. Further penetration enhancers include polyoxyethylene-9-lauryl ether, polyoxyethylene-20-cetyl ether. Oligonucleotides of the invention may be delivered orally, in granular form including sprayed dried particles, or complexed to form micro or nanoparticles. Oligonucleotide complexing agents include poly-amino acids; polyimines; polyacrylates; polyalkylacrylates, polyoxethanes, polyalkylcyanoacrylates; cationized gelatins, albumins, starches, acrylates, polyethyleneglycols (PEG) and starches; polyalkylcyanoacrylates; DEAE-derivatized polyimines, pollulans, celluloses and starches. Particularly preferred complexing agents include chitosan, N-trimethylchitosan, poly-L-lysine, polyhistidine, polyornithine, polyspermines, protamine, polyvinylpyridine, polythiodiethylamino-methylethylene P(TDAE), polyaminostyrene (e.g. p-amino), poly(methylcyanoacrylate), poly(ethylcyanoacrylate), poly(butylcyanoacrylate), poly(isobutylcyanoacrylate), poly(isohexylcynaoacrylate), DEAE-methacrylate, DEAE-hexylacrylate, DEAE-acrylamide, DEAE-albumin and DEAE-dextran, polymethylacrylate, polyhexylacrylate, poly(D,L-lactic acid), poly(DL-lactic-co-glycolic acid (PLGA), alginate, and polyethyleneglycol (PEG). Oral formulations for oligonucleotides and their preparation are described in detail in U.S. applications Ser. No. 08/886,829 (filed Jul. 1, 1997), Ser. No. 09/108,673 (filed Jul. 1, 1998), Ser. No. 09/256,515 (filed Feb. 23, 1999), Ser. No. 09/082,624 (filed May 21, 1998) and Ser. No. 09/315,298 (filed May 20, 1999), each of which is incorporated herein by reference in their entirety.

[0071] Compositions and formulations for parenteral, intrathecal or intraventricular administration may include sterile aqueous solutions which may also contain buffers, diluents and other suitable additives such as, but not limited to, penetration enhancers, carrier compounds and other pharmaceutically acceptable carriers or excipients.

[0072] Pharmaceutical compositions of the present invention include, but are not limited to, solutions, emulsions, and liposome-containing formulations. These compositions may be generated from a variety of components that include, but are not limited to, preformed liquids, self-emulsifying solids and self-emulsifying semisolids.

[0073] The pharmaceutical formulations of the present invention, which may conveniently be presented in unit dosage form, may be prepared according to conventional techniques well known in the pharmaceutical industry. Such techniques include the step of bringing into association the active ingredients with the pharmaceutical carrier9s) or excipient(s). In general, the formulations are prepared by uniformly and intimately bringing into association the active ingredients with liquid carriers or finely divided solid carriers or both, and then, if necessary, shaping the product.

[0074] The compositions of the present invention may be formulated into any of many possible dosage forms such as, but not limited to, tablets, capsules, gel capsules, liquid syrups, soft gels, suppositories, and enemas. The compositions of the present invention may also be formulated as suspensions in aqueous, non-aqueous or mixed media. Aqueous suspensions may further contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran. The suspension may also contain stabilizers.

[0075] In one embodiment of the present invention the pharmaceutical compositions may be formulated and used as foams. Pharmaceutical foams include formulations such as, but not limited to, emulsions, microemulsions, creams, jellies and liposomes. While basically similar in nature these formulations vary in the components and the consistency of the final product. The preparation of such compositions and formulations is generally known to those skilled in the pharmaceutical and formulation arts and may be applied to the formulation of the compositions of the present invention.

[0076] Emulsions

[0077] The compositions of the present invention may be prepared and formulated as emulsions. Emulsions are typically heterogenous systems of one liquid dispersed in another in the form of droplets usually exceeding 0.1 μm in diameter Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199; Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., Volume 1, p. 245; Block in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 2, p. 335; Higuchi et al., in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., 1985, p. 301). Emulsions are often biphasic systems comprising two immiscible liquid phases intimately mixed and dispersed with each other. In general, emulsions may be of either the water-in-oil (w/o) or the oil-in-water 9o/w) variety. When an aqueous phase is finely divided into and dispersed as minute droplets into a bulk oily phase, the resulting composition is called a water-in-oil (w/o) emulsion. Alternatively, when an oily phase is finely divided into and dispersed as minute droplets into a bulk aqueous phase, the resulting composition is called an oil-in-water (o/w) emulsion. Emulsions may contain additional components in addition to the dispersed phases, and the active drug which may be present as a solution in either the aqueous phase, oily phase or itself as a separate phase. Pharmaceutical excipients such as emulsifiers, stabilizers, dyes, and anti-oxidants may also be present in emulsions as needed. Pharmaceutical emulsions may also be multiple emulsions that are comprised of more than two phases such as, for example, in the case of oil-in-water-in-oil (o/w/o) and water-in-oil-in-water (w/o/w) emulsions. Such complex formulations often provide certain advantages that simple binary emulsions do not. Multiple emulsions in which individual oil droplets of an o/w emulsion enclose small water droplets constitute a w/o/w emulsion. Likewise a system of oil droplets enclosed in globules of water stabilized in an oily continuous phase provides an o/w/o emulsion.

[0078] Emulsions are characterized by little or no thermodynamic stability. Often, the dispersed or discontinuous phase of the emulsion is well dispersed into the external or continuous phase and maintained in this form through the means of emulsifiers or the viscosity of the formulation. Either of the phases of the emulsion may be a semisolid or a solid, as is the case of emulsion-style ointment bases and creams. Other means of stabilizing emulsions entail the use of emulsifiers that may be incorporated into either phase of the emulsion. Emulsifiers may broadly be classified into four categories: synthetic surfactants, naturally occurring emulsifiers, absorption bases, and finely dispersed solids (Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199).

[0079] Synthetic surfactants, also known as surface active agents, have found wide applicability in the formulation of emulsions and have been reviewed in the literature Rieger, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 285; Idson, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), Marcel Dekker, Inc., New York, N.Y., 1988, volume 1, p. 199). Surfactants are typically amphiphilic and comprise a hydrophilic and a hydrophobic portion. The ratio of the hydrophilic to the hydrophobic nature of the surfactant has been termed the hydrophile/lipophile balance (HLB) and is a valuable tool in categorizing and selecting surfactants in the preparation of formulations. Surfactants may be classified into different classes based on the nature of the hydrophilic group: nonionic, anionic, cationic and amphoteric Rieger, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 285).

[0080] Naturally occurring emulsifiers used in emulsion formulations include lanolin, beeswax, phosphatides, lecithin and acacia. Absorption bases possess hydrophilic properties such that they can soak up water to form w/o emulsions yet retain their semisolid consistencies, such as anhydrous lanolin and hydrophilic petrolatum. Finely divided solids have also been used as good emulsifiers especially in combination with surfactants and in viscous preparations. These include polar inorganic solids, such as heavy metal hydroxides, nonswelling clays such as bentonite, attapulgite, hectorite, kaolin, montmorillonite, colloidal aluminum silicate and colloidal magnesium aluminum silicate, pigments and nonpolar solids such as carbon or glyceryl tristearate.

[0081] A large variety of non-emulsifying materials are also included in emulsion formulations and contribute to the properties of emulsions. These include fats, oils, waxes, fatty acids, fatty alcohols, fatty esters, humectants, hydrophilic colloids, preservatives and antioxidants Block, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 335; Idson, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199).

[0082] Hydrophilic colloids or hydrocolloids include naturally occurring gums and synthetic polymers such as polysaccharides (for example, acacia, agar, alginic acid, carrageenan, guar gum, karaya gum, and tragacanth), cellulose derivatives (for example, carboxymethylcellulose and carboxypropylcellulose), and synthetic polymers (for example, carbomers, cellulose ethers, and carboxyvinyl polymers). These disperse or swell in water to form colloidal solutions that stabilize emulsions by forming strong interfacial films around the dispersed-phase droplets and by increasing the viscosity of the external phase.

[0083] Since emulsions often contain a number of ingredients such as carbohydrates, proteins, sterols and phosphatides that may readily support the growth of microbes, these formulations often incorporate preservatives. Commonly used preservatives included in emulsion formulations include methyl paraben, propyl paraben, quaternary ammonium salts, benzalkonium chloride, esters of p-hydroxybenzoic acid, and boric acid. Antioxidants are also commonly added to emulsion formulations to prevent deterioration of the formulation. Antioxidants used may be free radical scavengers such as tocopherols, alkyl gallates, butylated hydroxyanisole, butylated hydroxytoluene, or reducing agents such as ascorbic acid and sodium metabisulfite, and antioxidant synergists such as citric acid, tartaric acid, and lecithin.

[0084] The application of emulsion formulations via dermatological, oral and parenteral routes and methods for their manufacture have been reviewed in the literature (Idson, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199). Emulsion formulations for oral delivery have been very widely used because of ease of formulation, as well as efficacy from an absorption and bioavailability standpoint (Rosoff, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245; Idson, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199). Mineral-oil base laxatives, oil-soluble vitamins and high fat nutritive preparations are among the materials that have commonly been administered orally as o/w emulsions.

[0085] In one embodiment of the present invention, the compositions of oligonucleotides and nucleic acids are formulated as microemulsions. A microemulsion may be defined as a system of water, oil and amphiphile which is a single optically isotropic and thermodynamically stable liquid solution (Rosoff, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245). Typically microemulsions are systems that are prepared by first dispersing an oil in an aqueous surfactant solution and then adding a sufficient amount of a fourth component, generally an intermediate chain-length alcohol to form a transparent system. Therefore, microemulsions have also been described as thermodynamically stable, isotropically clear dispersions of two immiscible liquids that are stabilized by interfacial films of surface-active molecules (Leung and Shah, in: Controlled Release of Drugs: Polymers and Aggregate Systems, Rosoff, M., Ed., 1989, VCH Publishers, New York, pages 185-215). Microemulsions commonly are prepared via a combination of three to five components that include oil, water, surfactant, cosurfactant and electrolyte. Whether the microemulsion is of the water-in-oil (w/o) or an oil-in-water (o/w) type is dependent on the properties of the oil and surfactant used and on the structure and geometric packing of the polar heads and hydrocarbon tails of the surfactant molecules (Schott, in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., 1985, p. 271).

[0086] The phenomenological approach utilizing phase diagrams has been extensively studied and has yielded a comprehensive knowledge, to one skilled in the art, of how to formulate microemulsions (Rosoff, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245; Block, in Phamaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 335). Compared to conventional emulsions, microemulsions offer the advantage of solubilizing water-insoluble drugs in a formulation of thermodynamically stable droplets that are formed spontaneously.

[0087] Surfactants used in the preparation of microemulsions include, but are not limited to, ionic surfactants, non-ionic surfactants, Brij 96, polyoxyethylene oleyl ethers, polyglycerol fatty acid esters, tetraglycerol monolaurate (ML310), tetraglycerol monooleate (MO310), hexaglycerol monooleate (PO310), hexaglycerol pentaoleate (PO500), decaglycerol monocaprate (MCA750), decaglycerol monooleate (MO750), decaglycerol sequioleate (SO750), decaglycerol decaoleate (DAO750), alone or in combination with cosurfactants. The cosurfactant, usually a short-chain alcohol such as ethanol, 1-propanol, and 1-butanol, serves to increase the interfacial fluidity by penetrating into the surfactant film and consequently creating a disordered film because of the void space generated among surfactant molecules. Microemulsions may, however, be prepared without the use of cosurfactants and alcohol-free self-emulsifying microemulsion systems are known in the art. The aqueous phase may typically be, but is not limited to, water, an aqueous solution of the drug, glycerol, PEG300, PEG400, polyglycerols, propylene glycols, and derivatives of ethylene glycol. The oil phase may include, but is not limited to, materials such as Captex 300, Captex 355, Capmul MCM, fatty acid esters, medium chain (C8-C12) mono, di, and tri-glycerides, polyoxyethylated glyceryl fatty acid esters, fatty alcohols, polyglycolized glycerides, saturated polyglycolized C8-C10glycerides, vegetable oils and silicone oil.

[0088] Microemulsions are particularly of interest from the standpoint of drug solubilization and the enhanced absorption of drugs. Lipid based microemulsions (both o/w and w/o) have been proposed to enhance the oral bioavailability of drugs, including peptides (Constantinides et al., Pharmaceutical Research, 1994, 11, 1385-1390; Ritschel, Meth. Find. Exp. Clin. Pharmacol., 1993, 13, 205). Microemulsions afford advantages of improved drug solubilization, protection of drug from enzymatic hydrolysis, possible enhancement of drug absorption due to surfactant-induced alterations in membrane fluidity and permeability, ease of preparation, ease of oral administration over solid dosage forms, improved clinical potency, and decreased toxicity (Constantinides et al., Pharmaceutical Research, 1994, 11, 1385; Ho et al., J. Pharm. Sci., 1996, 85, 138-143). Often microemulsions may form spontaneously when their components are brought together at ambient temperature. This may be particularly advantageous when formulating thermolabile drugs, peptides or oligonucleotides. Microemulsions have also been effective in the transdermal delivery of active components in both cosmetic and pharmaceutical applications. It is expected that the microemulsion compositions and formulations of the present invention will facilitate the increased systemic absorption of oligonucleotides and nucleic acids from the gastrointestinal tract, as well as improve the local cellular uptake of oligonucleotides and nucleic acids within the gastrointestinal tract, vagina, buccal cavity and other areas of administration.

[0089] Microemulsions of the present invention may also contain additional components and additives such as sorbitan monostearate (Grill 3), Labrasol, and penetration enhancers to improve the properties of the formulation and to enhance the absorption of the oligonucleotides and nucleic acids of the present invention. Penetration enhancers used in the microemulsions of the present invention may be classified as belonging to one of five broad categories—surfactants, fatty acids, bile salts, chelating agents, and non-chelating non-surfactants (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p. 92). Each of these classes has been discussed above.

[0090] Liposomes

[0091] There are many organized surfactant structures besides microemulsions that have been studied and used for the formulation of drugs. These include monolayers, micelles, bilayers and vesicles. Vesicles, such as liposomes, have attracted great interest because of their specificity and the duration of action they offer from the standpoint of drug delivery. As used in the present invention, the term “liposome” means a vesicle composed of amphiphilic lipids arranged in a spherical bilayer or bilayers.

[0092] Liposomes are unilamellar or multilamellar vesicles which have a membrane formed from a lipophilic material and an aqueous interior. The aqueous portion contains the composition to be delivered. Cationic liposomes possess the advantage of being able to fuse to the cell wall. Non-cationic liposomes, although not able to fuse as efficiently with the cell wall, are taken up by macrophages in vivo.

[0093] In order to cross intact mammalian skin, lipid vesicles must pass through a series of fine pores, each with a diameter less than 50 nm, under the influence of a suitable transdermal gradient. Therefore, it is desirable to use a liposome which is highly deformable and able to pass through such fine pores.

[0094] Further advantages of liposomes include; liposomes obtained from natural phospholipids are biocompatible and biodegradable; liposomes can incorporate a wide range of water and lipid soluble drugs; liposomes can protect encapsulated drugs in their internal compartments from metabolism and degradation (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245). Important considerations in the preparation of liposome formulations are the lipid surface charge, vesicle size and the aqueous volume of the liposomes.

[0095] Liposomes are useful for the transfer and delivery of active ingredients to the site of action. Because the liposomal membrane is structurally similar to biological membranes, when liposomes are applied to a tissue, the liposomes start to merge with the cellular membranes and as the merging of the liposome and cell progresses, the liposomal contents are emptied into the cell where the active agent may act.

[0096] Liposomal formulations have been the focus of extensive investigation as the mode of delivery for many drugs. There is growing evidence that for topical administration, liposomes present several advantages over other formulations. Such advantages include reduced side-effects related to high systemic absorption of the administered drug, increased accumulation of the administered drug at the desired target, and the ability to administer a wide variety of drugs, both hydrophilic and hydrophobic, into the skin.

[0097] Several reports have detailed the ability of liposomes to deliver agents including high-molecular weight DNA into the skin. Compounds including analgesics, antibodies, hormones and high-molecular weight DNAs have been administered to the skin. The majority of applications resulted in the targeting of the upper epidermis.

[0098] Liposomes fall into two broad classes. Cationic liposomes are positively charged liposomes which interact with the negatively charged DNA molecules to form a stable complex. The positively charged DNA/liposome complex binds to the negatively charged cell surface and is internalized in an endosome. Due to the acidic pH within the endosome, the liposomes are ruptured, releasing their contents into the cell cytoplasm (Wang et al., Biochem. Biophys. Res. Commun., 1987, 147, 980-985).

[0099] Liposomes which are pH-sensitive or negatively-charged, entrap DNA rather than complex with it. Since both the DNA and the lipid are similarly charged, repulsion rather than complex formation occurs. Nevertheless, some DNA is entrapped within the aqueous interior of these liposomes. pH-sensitive liposomes have been used to deliver DNA encoding the thymidine kinase gene to cell monolayers in culture. Expression of the exogenous gene was detected in the target cells (Zhou et al., Journal of Controlled Release, 1992, 19, 269-274).

[0100] One major type of liposomal composition includes phospholipids other than naturally-derived phosphatidylcholine. Neutral liposome compositions, for example, can be formed from dimyristoyl phosphatidylcholine (DMPC) or dipalmitoyl phosphatidylcholine (DPPC). Anionic liposome compositions generally are formed from dimyristoyl phosphatidylglycerol, while anionic fusogenic liposomes are formed primarily from dioleoyl phosphatidylethanolamine (DOPE). Another type of liposomal composition is formed from phosphatidylcholine (PC) such as, for example, soybean PC, and egg PC. Another type is formed from mixtures of phospholipid and/or phosphatidylcholine and/or cholesterol.

[0101] Several studies have assessed the topical delivery of liposomal drug formulations to the skin. Application of liposomes containing interferon to guinea pig skin resulted in a reduction of skin herpes sores while delivery of interferon via other means (e.g. as a solution or as an emulsion) were ineffective Weiner et al., Journal of Drug Targeting, 1992, 2, 405-410). Further, an additional study tested the efficacy of interferon administered as part of a liposomal formulation to the administration of interferon using an aqueous system, and concluded that the liposomal formulation was superior to aqueous administration (du Plessis et al., Antiviral Research, 1992, 18, 259-265).

[0102] Non-ionic liposomal systems have also been examined to determine their utility in the delivery of drugs to the skin, in particular systems comprising non-ionic surfactant and cholesterol. Non-ionic liposomal formulations comprising Novasome™ I (glyceryl dilaurate/cholesterol/polyoxyethylene-10-stearyl ether) and Novasome™ II (glyceryl distearate/cholesterol/polyoxyethylene-10-stearyl ether) were used to deliver cyclosporin-A into the dermis of mouse skin. Results indicated that such non-ionic liposomal systems were effective in facilitating the deposition of cyclosporin-A into different layers of the skin Hu et al. S.T.P. Pharma. Sci., 1994, 4, 6, 466).

[0103] Liposomes also include “sterically stabilized” liposomes, a term which, as used herein, refers to liposomes comprising one or more specialized lipids that, when incorporated into liposomes, result in enhanced circulation lifetimes relative to liposomes lacking such specialized lipids. Examples of sterically stabilized liposomes are those in which part of the vesicle-forming lipid portion of the liposome (A) comprises one or more glycolipids, such as monosialoganglioside G_(M1), or (B) is derivatized with one or more hydrophilic polymers, such as a polyethylene glycol (PEG) moiety. While not wishing to be bound by any particular theory, it is thought in the art that, at least for sterically stabilized liposomes containing gangliosides, sphingomyelin, or PEG-derivatized lipids, the enhanced circulation half-life of these sterically stabilized liposomes derives from a reduced uptake into cells of the reticuloendothelial system (RES) (Allen et al., FEBS Letters, 1987, 223, 42; Wu et al., Cancer Research, 1993, 53, 3765).

[0104] Various liposomes comprising one or more glycolipids are known in the art. Papahadjopoulos et al. (Ann. N.Y. Acad. Sci., 1987, 507, 64) reported the ability of monosialoganglioside G_(M1), galactocerebroside sulfate and phosphatidylinositol to improve blood half-lives of liposomes. These findings were expounded upon by Gabizon et al. (Proc. Natl. Acad. Sci. U.S.A., 1988, 85, 6949). U.S. Pat. No. 4,837,028 and WO 88/04924, both to Allen et al., disclose liposomes comprising (1) sphingomyelin and (2) the ganglioside G_(M1) or a galactocerebroside sulfate ester. U.S. Pat. No. 5,543,152 (Webb et al.) discloses liposomes comprising sphingomyelin. Liposomes comprising 1,2-sn-dimyristoylphosphatidylcholine are disclosed in WO 97/13499 (Lim et al.).

[0105] Many liposomes comprising lipids derivatized with one or more hydrophilic polymers, and methods of preparation thereof, are known in the art. Sunamoto et al. (Bull. Chem. Soc. Jpn., 1980, 53, 2778) described liposomes comprising a nonionic detergent, 2C₁₂15G, that contains a PEG moiety. Illum et al. (FEBS Lett., 1984, 167, 79) noted that hydrophilic coating of polystyrene particles with polymeric glycols results in significantly enhanced blood half-lives. Synthetic phospholipids modified by the attachment of carboxylic groups of polyalkylene glycols (e.g., PEG) are described by Sears (U.S. Pat. Nos. 4,426,330 and 4,534,899). Klibanov et al. FEBS Lett., 1990, 268, 235) described experiments demonstrating that liposomes comprising phosphatidylethanolamine (PE) derivatized with PEG or PEG stearate have significant increases in blood circulation half-lives. Blume et al. (Biochimica et Biophysica Acta, 1990, 1029, 91) extended such observations to other PEG-derivatized phospholipids, e.g., DSPE-PEG, formed from the combination of distearoylphosphatidylethanolamine (DSPE) and PEG. Liposomes having covalently bound PEG moieties on their external surface are described in European Patent No. EP 0 445 131 B1 and WO 90/04384 to Fisher. Liposome compositions containing 1-20 mole percent of PE derivatized with PEG, and methods of use thereof, are described by Woodle et al. (U.S. Pat. Nos. 5,013,556 and 5,356,633) and Martin et al. (U.S. Pat. No. 5,213,804 and European Patent No. EP 0 496 813 B1). Liposomes comprising a number of other lipid-polymer conjugates are disclosed in WO 91/05545 and U.S. Pat. No. 5,225,212 (both to Martin et al.) and in WO 94/20073 (Zalipsky et al.) Liposomes comprising PEG-modified ceramide lipids are described in WO 96/10391 (Choi et al.). U.S. Pat. No. 5,540,935 (Miyazaki et al.) and U.S. Pat. No. 5,556,948 (Tagawa et al.) describe PEG-containing liposomes that can be further derivatized with functional moieties on their surfaces.

[0106] A limited number of liposomes comprising nucleic acids are known in the art. WO 96/40062 to Thierry et al. discloses methods for encapsulating high molecular weight nucleic acids in liposomes. U.S. Pat. No. 5,264,221 to Tagawa et al. discloses protein-bonded liposomes and asserts that the contents of such liposomes may include an antisense RNA. U.S. Pat. No. 5,665,710 to Rahman et al. describes certain methods of encapsulating oligodeoxynucleotides in liposomes. WO 97/04787 to Love et al. discloses liposomes comprising antisense oligonucleotides targeted to the raf gene.

[0107] Transfersomes are yet another type of liposomes, and are highly deformable lipid aggregates which are attractive candidates for drug delivery vehicles. Transfersomes may be described as lipid droplets which are so highly deformable that they are easily able to penetrate through pores which are smaller than the droplet. Transfersomes are adaptable to the environment in which they are used, e.g. they are self-optimizing (adaptive to the shape of pores in the skin), self-repairing, frequently reach their targets without fragmenting, and often self-loading. To make transfersomes it is possible to add surface edge-activators, usually surfactants, to a standard liposomal composition. Transfersomes have been used to deliver serum albumin to the skin. The transfersome-mediated delivery of serum albumin has been shown to be as effective as subcutaneous injection of a solution containing serum albumin.

[0108] Surfactants find wide application in formulations such as emulsions (including microemulsions) and liposomes. The most common way of classifying and ranking the properties of the many different types of surfactants, both natural and synthetic, is by the use of the hydrophile/lipophile balance (HLB). The nature of the hydrophilic group (also known as the “head”) provides the most useful means for categorizing the different surfactants used in formulations (Rieger, in Phamaceutical Dosage Forms, Marcel Dekker, Inc., New York, N.Y., 1988, p. 285).

[0109] If the surfactant molecule is not ionized, it is classified as a nonionic surfactant. Nonionic surfactants find wide application in pharmaceutical and cosmetic products and are usable over a wide range of pH values. In general their HLB values range from 2 to about 18 depending on their structure. Nonionic surfactants include nonionic esters such as ethylene glycol esters, propylene glycol esters, glyceryl esters, polyglyceryl esters, sorbitan esters, sucrose esters, and ethoxylated esters. Nonionic alkanolamides and ethers such as fatty alcohol ethoxylates, propoxylated alcohols, and ethoxylated/propoxylated block polymers are also included in this class. The polyoxyethylene surfactants are the most popular members of the nonionic surfactant class.

[0110] If the surfactant molecule carries a negative charge when it is dissolved or dispersed in water, the surfactant is classified as anionic. Anionic surfactants include carboxylates such as soaps, acyl lactylates, acyl amides of amino acids, esters of sulfuric acid such as alkyl sulfates and ethoxylated alkyl sulfates, sulfonates such as alkyl benzene sulfonates, acyl isethionates, acyl taurates and sulfosuccinates, and phosphates. The most important members of the anionic surfactant class are the alkyl sulfates and the soaps.

[0111] If the surfactant molecule carries a positive charge when it is dissolved or dispersed in water, the surfactant is classified as cationic. Cationic surfactants include quaternary ammonium salts and ethoxylated amines. The quaternary ammonium salts are the most used members of this class.

[0112] If the surfactant molecule has the ability to carry either a positive or negative charge, the surfactant is classified as amphoteric. Amphoteric surfactants include acrylic acid derivatives, substituted alkylamides, N-alkylbetaines and phosphatides.

[0113] The use of surfactants in drug products, formulations and in emulsions has been reviewed (Rieger, in Pharmaceutical Dosage Forms, Marcel Dekker, Inc., New York, N.Y., 1988, p. 285).

[0114] Penetration Enhancers

[0115] In one embodiment, the present invention employs various penetration enhancers to effect the efficient delivery of nucleic acids, particularly oligonucleotides, to the skin of animals. Most drugs are present in solution in both ionized and nonionized forms. However, usually only lipid soluble or lipophilic drugs readily cross cell membranes. It has been discovered that even non-lipophilic drugs may cross cell membranes if the membrane to be crossed is treated with a penetration enhancer. In addition to aiding the diffusion of non-lipophilic drugs across cell membranes, penetration enhancers also enhance the permeability of lipophilic drugs.

[0116] Penetration enhancers may be classified as belonging to one of five broad categories, i.e., surfactants, fatty acids, bile salts, chelating agents, and non-chelating non-surfactants (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p. 92). Each of the above mentioned classes of penetration enhancers are described below in greater detail.

[0117] Surfactants: In connection with the present invention, surfactants (or “surface-active agents”) are chemical entities which, when dissolved in an aqueous solution, reduce the surface tension of the solution or the interfacial tension between the aqueous solution and another liquid, with the result that absorption of oligonucleotides through the mucosa is enhanced. In addition to bile salts and fatty acids, these penetration enhancers include, for example, sodium lauryl sulfate, polyoxyethylene-9-lauryl ether and polyoxyethylene-20-cetyl ether) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p. 92); and perfluorochemical emulsions, such as FC-43. Takahashi et al., J. Pharm. Pharmacol., 1988, 40, 252).

[0118] Fatty acids: Various fatty acids and their derivatives which act as penetration enhancers include, for example, oleic acid, lauric acid, capric acid (n-decanoic acid), myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein (1-monooleoyl-rac-glycerol), dilaurin, caprylic acid, arachidonic acid, glycerol 1-monocaprate, 1-dodecylazacycloheptan-2-one, acylcarnitines, acylcholines, C₁₋₁₀ alkyl esters thereof (e.g., methyl, isopropyl and t-butyl), and mono- and di-glycerides thereof (i.e., oleate, laurate, caprate, myristate, palmitate, stearate, linoleate, etc.) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p. 92; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; El Hariri et al., J. Pharm. Pharmacol., 1992, 44, 651-654).

[0119] Bile salts: The physiological role of bile includes the facilitation of dispersion and absorption of lipids and fat-soluble vitamins (Brunton, Chapter 38 in: Goodman & Gilman's The Pharmacological Basis of Therapeutics, 9th Ed., Hardman et al. (Eds., McGraw-Hill, New York, 1996, pp. 934-935). Various natural bile salts, and their synthetic derivatives, act as penetration enhancers. Thus the term “bile salts” includes any of the naturally occurring components of bile as well as any of their synthetic derivatives. The bile salts of the invention include, for example, cholic acid (or its pharmaceutically acceptable sodium salt, sodium cholate), dehydrocholic acid (sodium dehydrocholate), deoxycholic acid (sodium deoxycholate), glucholic acid (sodium glucholate), glycholic acid (sodium glycocholate), glycodeoxycholic acid (sodium glycodeoxycholate), taurocholic acid (sodium taurocholate), taurodeoxycholic acid (sodium taurodeoxycholate), chenodeoxycholic acid (sodium chenodeoxycholate), ursodeoxycholic acid (UDCA), sodium tauro-24,25-dihydro-fusidate (STDHF), sodium glycodihydrofusidate and polyoxyethylene-9-lauryl ether (POE) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92; Swinyard, Chapter 39 In: Remington's Pharmaceutical Sciences, 18th Ed., Gennaro, ed., Mack Publishing Co., Easton, Pa., 1990, pages 782-783; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; Yamamoto et al., J. Pharm. Exp. Ther., 1992, 263, 25; Yamashita et al., J. Pharm. Sci., 1990, 79, 579-583).

[0120] Chelating Agents: Chelating agents, as used in connection with the present invention, can be defined as compounds that remove metallic ions from solution by forming complexes therewith, with the result that absorption of oligonucleotides through the mucosa is enhanced. With regards to their use as penetration enhancers in the present invention, chelating agents have the added advantage of also serving as DNase inhibitors, as most characterized DNA nucleases require a divalent metal ion for catalysis and are thus inhibited by chelating agents (Jarrett, J. Chromatogr., 1993, 618, 315-339). Chelating agents of the invention include but are not limited to disodium ethylenediaminetetraacetate (EDTA), citric acid, salicylates (e.g., sodium salicylate, 5-methoxysalicylate and homovanilate), N-acyl derivatives of collagen, laureth-9 and N-amino acyl derivatives of beta-diketones (enamines)(Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; Buur et al., J. Control Rel., 1990, 14, 43-51).

[0121] Non-chelating non-surfactants: As used herein, non-chelating non-surfactant penetration enhancing compounds can be defined as compounds that demonstrate insignificant activity as chelating agents or as surfactants but that nonetheless enhance absorption of oligonucleotides through the alimentary mucosa (Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33). This class of penetration enhancers include, for example, unsaturated cyclic ureas, 1-alkyl- and 1-alkenylazacyclo-alkanone derivatives (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92); and non-steroidal anti-inflammatory agents such as diclofenac sodium, indomethacin and phenylbutazone (Yamashita et al., J. Pharm. Pharmacol., 1987, 39, 621-626).

[0122] Agents that enhance uptake of oligonucleotides at the cellular level may also be added to the pharmaceutical and other compositions of the present invention. For example, cationic lipids, such as lipofectin (Junichi et al, U.S. Pat. No. 5,705,188), cationic glycerol derivatives, and polycationic molecules, such as polylysine (Lollo et al., PCT Application WO 97/30731), are also known to enhance the cellular uptake of oligonucleotides.

[0123] Other agents may be utilized to enhance the penetration of the administered nucleic acids, including glycols such as ethylene glycol and propylene glycol, pyrrols such as 2-pyrrol, azones, and terpenes such as limonene and menthone.

[0124] Carriers

[0125] Certain compositions of the present invention also incorporate carrier compounds in the formulation. As used herein, “carrier compound” or “carrier” can refer to a nucleic acid, or analog thereof, which is inert (i.e., does not possess biological activity per se) but is recognized as a nucleic acid by in vivo processes that reduce the bioavailability of a nucleic acid having biological activity by, for example, degrading the biologically active nucleic acid or promoting its removal from circulation. The coadministration of a nucleic acid and a carrier compound, typically with an excess of the latter substance, can result in a substantial reduction of the amount of nucleic acid recovered in the liver, kidney or other extracirculatory reservoirs, presumably due to competition between the carrier compound and the nucleic acid for a common receptor. For example, the recovery of a partially phosphorothioate oligonucleotide in hepatic tissue can be reduced when it is coadministered with polyinosinic acid, dextran sulfate, polycytidic acid or 4-acetamido-4′isothiocyano-stilbene-2,2′-disulfonic acid (Miyao et al., Antisense Res. Dev., 1995, 5, 115-121; Takakura et al., Antisense & Nucl. Acid Drug Dev., 1996, 6, 177-183).

[0126] Excipients

[0127] In contrast to a carrier compound, a “pharmaceutical carrier” or “excipient” is a pharmaceutically acceptable solvent, suspending agent or any other pharmacologically inert vehicle for delivering one or more nucleic acids to an animal. The excipient may be liquid or solid and is selected, with the planned manner of administration in mind, so as to provide for the desired bulk, consistency, etc., when combined with a nucleic acid and the other components of a given pharmaceutical composition. Typical pharmaceutical carriers include, but are not limited to, binding agents (e.g., pregelatinized maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose, etc.); fillers (e.g., lactose and other sugars, microcrystalline cellulose, pectin, gelatin, calcium sulfate, ethyl cellulose, polyacrylates or calcium hydrogen phosphate, etc.); lubricants (e.g., magnesium stearate, talc, silica, colloidal silicon dioxide, stearic acid, metallic stearates, hydrogenated vegetable oils, corn starch, polyethylene glycols, sodium benzoate, sodium acetate, etc.); disintegrants (e.g., starch, sodium starch glycolate, etc.); and wetting agents (e.g., sodium lauryl sulphate, etc.).

[0128] Pharmaceutically acceptable organic or inorganic excipient suitable for non-parenteral administration which do not deleteriously react with nucleic acids can also be used to formulate the compositions of the present invention. Suitable pharmaceutically acceptable carriers include, but are not limited to, water, salt solutions, alcohols, polyethylene glycols, gelatin, lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, hydroxymethylcellulose, polyvinylpyrrolidone and the like.

[0129] Formulations for topical administration of nucleic acids may include sterile and non-sterile aqueous solutions, non-aqueous solutions in common solvents such as alcohols, or solutions of the nucleic acids in liquid or solid oil bases. The solutions may also contain buffers, diluents and other suitable additives. Pharmaceutically acceptable organic or inorganic excipients suitable for non-parenteral administration which do not deleteriously react with nucleic acids can be used.

[0130] Suitable pharmaceutically acceptable excipients include, but are not limited to, water, salt solutions, alcohol, polyethylene glycols, gelatin, lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, hydroxymethylcellulose, polyvinylpyrrolidone and the like.

[0131] Other Components

[0132] The compositions of the present invention may additionally contain other adjunct components conventionally found in pharmaceutical compositions, at their art-established usage levels. Thus, for example, the compositions may contain additional, compatible, pharmaceutically-active materials such as, for example, antipruritics, astringents, local anesthetics or anti-inflammatory agents, or may contain additional materials useful in physically formulating various dosage forms of the compositions of the present invention, such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, thickening agents and stabilizers. However, such materials, when added, should not unduly interfere with the biological activities of the components of the compositions of the present invention. The formulations can be sterilized and, if desired, mixed with auxiliary agents, e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic substances and the like which do not deleteriously interact with the nucleic acid(s) of the formulation.

[0133] Aqueous suspensions may contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran. The suspension may also contain stabilizers.

[0134] Certain embodiments of the invention provide pharmaceutical compositions containing (a) one or more antisense compounds and (b) one or more other chemotherapeutic agents which function by a non-antisense mechanism. Examples of such chemotherapeutic agents include but are not limited to daunorubicin, daunomycin, dactinomycin, doxorubicin, epirubicin, idarubicin, esorubicin, bleomycin, mafosfamide, ifosfamide, cytosine arabinoside, bis-chloroethylnitrosurea, busulfan, mitomycin C, actinomycin D, mithramycin, prednisone, hydroxyprogesterone, testosterone, tamoxifen, dacarbazine, procarbazine, hexamethylmelamine, pentamethylmelamine, mitoxantrone, amsacrine, chlorambucil, methylcyclohexylnitrosurea, nitrogen mustards, melphalan, cyclophosphamide, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-azacytidine, hydroxyurea, deoxycoformycin, 4-hydroxyperoxycyclophosphoramide, 5-fluorouracil (5-FU), 5-fluorodeoxyuridine (5-FUdR), methotrexate (MTX), colchicine, taxol, vincristine, vinblastine, etoposide (VP-16), trimetrexate, irinotecan, topotecan, gemcitabine, teniposide, cisplatin and diethylstilbestrol (DES). See, generally, The Merck Manual of Diagnosis and Therapy, 15th Ed. 1987, pp. 1206-1228, Berkow et al., eds., Rahway, N.J. When used with the compounds of the invention, such chemotherapeutic agents may be used individually (e.g., 5-FU and oligonucleotide), sequentially e.g., (5-FU and oligonucleotide for a period of time followed by MTX and oligonucleotide), or in combination with one or more other such chemotherapeutic agents (e.g., 5-FU, MTX and oligonucleotide, or 5-FU, radiotherapy and oligonucleotide). Anti-inflammatory drugs, including but not limited to nonsteroidal anti-inflammatory drugs and corticosteroids, and antiviral drugs, including but not limited to ribivirin, vidarabine, acyclovir and ganciclovir, may also be combined in compositions of the invention. See, generally, The Merck Manual of Diagnosis and Therapy, 15th Ed., Berkow et al., eds., 1987, Rahway, N.J., pages 2499-2506 and 46-49, respectively). Other non-antisense chemotherapeutic agents are also within the scope of this invention. Two or more combined compounds may be used together or sequentially.

[0135] In another related embodiment, compositions of the invention may contain one or more antisense compounds, particularly oligonucleotides, targeted to a first nucleic acid and one or more additional antisense compounds targeted to a second nucleic acid target. Numerous examples of antisense compounds are known in the art. Two or more combined compounds may be used together or sequentially.

[0136] The formulation of therapeutic compositions and their subsequent administration is believed to be within the skill of those in the art. Dosing is dependent on severity and responsiveness of the disease state to be treated, with the course of treatment lasting from several days to several months, or until a cure is effected or a diminution of the disease state is achieved. Optimal dosing schedules can be calculated from measurements of drug accumulation in the body of the patient. Persons of ordinary skill can easily determine optimum dosages, dosing methodologies and repetition rates. Optimum dosages may vary depending on the relative potency of individual oligonucleotides, and can generally be estimated based on EC₅₀s found to be effective in in vitro and in vivo animal models. In general, dosage is from 0.01 ug to 100 g per kg of body weight, and may be given once or more daily, weekly, monthly or yearly, or even once every 2 to 20 years. Persons of ordinary skill in the art can easily estimate repetition rates for dosing based on measured residence times and concentrations of the drug in bodily fluids or tissues. Following successful treatment, it may be desirable to have the patient undergo maintenance therapy to prevent the recurrence of the disease state, wherein the oligonucleotide is administered in maintenance doses, ranging from 0.01 ug to 100 g per kg of body weight, once or more daily, to once every 20 years.

[0137] While the present invention has been described with specificity in accordance with certain of its preferred embodiments, the following examples serve only to illustrate the invention and are not intended to limit the same.

EXAMPLES Example 1

[0138] Nucleoside Phosphoramidites for Oligonucleotide Synthesis Deoxy and 2′-alkoxy amidites

[0139] 2′-Deoxy and 2′-methoxy beta-cyanoethyldiisopropyl phosphoramidites were purchased from commercial sources (e.g. Chemgenes, Needham Mass. or Glen Research, Inc. Sterling Va.). Other 2′-O-alkoxy substituted nucleoside amidites are prepared as described in U.S. Pat. No. 5,506,351, herein incorporated by reference. For oligonucleotides synthesized using 2′-alkoxy amidites, optimized synthesis cycles were developed that incorporate multiple steps coupling longer wait times relative to standard synthesis cycles.

[0140] The following abbreviations are used in the text: thin layer chromatography (TLC), melting point (MP), high pressure liquid chromatography (HPLC), Nuclear Magnetic Resonance (NMR), argon (Ar), methanol (MeOH), dichloromethane (CH₂Cl₂), triethylamine (TEA), dimethyl formamide (DMF), ethyl acetate (EtOAc), dimethyl sulfoxide (DMSO), tetrahydrofuran (THF).

[0141] Oligonucleotides containing 5-methyl-2′-deoxycytidine (5-Me-dC) nucleotides were synthesized according to published methods (Sanghvi, et. al., Nucleic Acids Research, 1993, 21, 3197-3203) using commercially available phosphoramidites (Glen Research, Sterling Va. or ChemGenes, Needham Mass.) or prepared as follows:

[0142] Preparation of 5′-O-Dimethoxytrityl-thymidine Intermediate for 5-methyl dC amidite

[0143] To a 50 L glass reactor equipped with air stirrer and Ar gas line was added thymidine (1.00 kg, 4.13 mol) in anhydrous pyridine (6 L) at ambient temperature. Dimethoxytrityl (DMT) chloride (1.47 kg, 4.34 mol, 1.05 eq) was added as a solid in four portions over 1 h. After 30 min, TLC indicated approx. 95% product, 2% thymidine, 5% DMT reagent and by-products and 2% 3′,5′-bis DMT product (R_(f) in EtOAc 0.45, 0.05, 0.98, 0.95 respectively). Saturated sodium bicarbonate (4 L) and CH₂Cl₂ were added with stirring (pH of the aqueous layer 7.5). An additional 18 L of water was added, the mixture was stirred, the phases were separated, and the organic layer was transferred to a second 50 L vessel. The aqueous layer was extracted with additional CH₂Cl₂ (2×2 L). The combined organic layer was washed with water (10 L) and then concentrated in a rotary evaporator to approx. 3.6 kg total weight. This was redissolved in CH₂Cl₂ (3.5 L), added to the reactor followed by water (6 L) and hexanes (13 L). The mixture was vigorously stirred and seeded to give a fine white suspended solid starting at the interface. After stirring for 1 h, the suspension was removed by suction through a ½″ diameter teflon tube into a 20 L suction flask, poured onto a 25 cm Coors Buchner funnel, washed with water (2×3 L) and a mixture of hexanes—CH₂Cl₂ (4:1, 2×L) and allowed to air dry overnight in pans (1″ deep). This was further dried in a vacuum oven (75° C., 0.1 mm Hg, 48 h) to a constant weight of 2072 g (93%) of a white solid, (mp 122-124° C.). TLC indicated a trace contamination of the bis DMT product. NMR spectroscopy also indicated that 1-2 mole percent pyridine and about 5 mole percent of hexanes was still present.

[0144] Preparation of 5′-O-Dimethoxytrityl-2′-deoxy-5-methylcytidine Intermediate for 5-methyl-dC amidite

[0145] To a 50 L Schott glass-lined steel reactor equipped with an electric stirrer, reagent addition pump (connected to an addition funnel), heating/cooling system, internal thermometer and an Ar gas line was added 5′-O-dimethoxytrityl-thymidine (3.00 kg, 5.51 mol), anhydrous acetonitrile (25 L) and TEA (12.3 L, 88.4 mol, 16 eq). The mixture was chilled with stirring to −10° C. internal temperature (external −20° C.). Trimethylsilylchloride (2.1 L, 16.5 mol, 3.0 eq) was added over 30 minutes while maintaining the internal temperature below −5° C., followed by a wash of anhydrous acetonitrile (1 L). Note: the reaction is mildly exothermic and copious hydrochloric acid fumes form over the course of the addition. The reaction was allowed to warm to 0° C. and the reaction progress was confirmed by TLC (EtOAc-hexanes 4:1; R_(f) 0.43 to 0.84 of starting material and silyl product, respectively). Upon completion, triazole (3.05 kg, 44 mol, 8.0 eq) was added the reaction was cooled to −20° C. internal temperature (external −30° C.). Phosphorous oxychloride (1035 mL, 11.1 mol, 2.01 eq) was added over 60 min so as to maintain the temperature between −20° C. and −10° C. during the strongly exothermic process, followed by a wash of anhydrous acetonitrile (1 L). The reaction was warmed to 0° C. and stirred for 1 h. TLC indicated a complete conversion to the triazole product (R_(f) 0.83 to 0.34 with the product spot glowing in long wavelength UV light). The reaction mixture was a peach-colored thick suspension, which turned darker red upon warming without apparent decomposition. The reaction was cooled to −15° C. internal temperature and water (5 L) was slowly added at a rate to maintain the temperature below +10° C. in order to quench the reaction and to form a homogenous solution. (Caution: this reaction is initially very strongly exothermic). Approximately one-half of the reaction volume (22 L) was transferred by air pump to another vessel, diluted with EtOAc (12 L) and extracted with water (2×8 L). The combined water layers were back-extracted with EtOAc (6 L). The water layer was discarded and the organic layers were concentrated in a 20 L rotary evaporator to an oily foam. The foam was coevaporated with anhydrous acetonitrile (4 L) to remove EtOAc. (note: dioxane may be used instead of anhydrous acetonitrile if dried to a hard foam). The second half of the reaction was treated in the same way. Each residue was dissolved in dioxane (3 L) and concentrated ammonium hydroxide (750 mL) was added. A homogenous solution formed in a few minutes and the reaction was allowed to stand overnight (although the reaction is complete within 1 h).

[0146] TLC indicated a complete reaction (product R_(f) 0.35 in EtOAc-MeOH 4:1). The reaction solution was concentrated on a rotary evaporator to a dense foam. Each foam was slowly redissolved in warm EtOAc (4 L; 50° C.); combined in a 50 L glass reactor vessel, and extracted with water (2×4L) to remove the triazole by-product. The water was back-extracted with EtOAc (2 L). The organic layers were combined and concentrated to about 8 kg total weight, cooled to 0° C. and seeded with crystalline product. After 24 hours, the first crop was collected on a 25 cm Coors Buchner funnel and washed repeatedly with EtOAc (3×3L) until a white powder was left and then washed with ethyl ether (2×3L). The solid was put in pans (1″ deep) and allowed to air dry overnight. The filtrate was concentrated to an oil, then redissolved in EtOAc (2 L), cooled and seeded as before. The second crop was collected and washed as before (with proportional solvents) and the filtrate was first extracted with water 2×L) and then concentrated to an oil. The residue was dissolved in EtOAc (1 L) and yielded a third crop which was treated as above except that more washing was required to remove a yellow oily layer.

[0147] After air-drying, the three crops were dried in a vacuum oven (50° C., 0.1 mm Hg, 24 h) to a constant weight (1750, 600 and 200 g, respectively) and combined to afford 2550 g (85%) of a white crystalline product (MP 215-217° C.) when TLC and NMR spectroscopy indicated purity. The mother liquor still contained mostly product (as determined by TLC) and a small amount of triazole (as determined by NMR spectroscopy), bis DMT product and unidentified minor impurities. If desired, the mother liquor can be purified by silica gel chromatography using a gradient of MeOH (0-25%) in EtOAc to further increase the yield.

[0148] Preparation of 5′-O-Dimethoxytrityl-2′-deoxy-N4-benzoyl-5-methylcytidine Penultimate Intermediate for 5-methyl dC amidite

[0149] Crystalline 5′-O-dimethoxytrityl-5-methyl-2′-deoxycytidine (2000 g, 3.68 mol) was dissolved in anhydrous DMF (6.0 kg) at ambient temperature in a 50 L glass reactor vessel equipped with an air stirrer and argon line. Benzoic anhydride Chem Impex not Aldrich, 874 g, 3.86 mol, 1.05 eq) was added and the reaction was stirred at ambient temperature for 8 h. TLC (CH₂Cl₂-EtOAc; CH₂Cl₂-EtOAc 4:1; R_(f) 0.25) indicated approx. 92% complete reaction. An additional amount of benzoic anhydride (44 g, 0.19 mol) was added. After a total of 18 h, TLC indicated approx. 96% reaction completion. The solution was diluted with EtOAc (20 L), TEA (1020 mL, 7.36 mol, ca 2.0 eq) was added with stirring, and the mixture was extracted with water (15 L, then 2×10 L). The aqueous layer was removed (no back-extraction was needed) and the organic layer was concentrated in 2×20 L rotary evaporator flasks until a foam began to form. The residues were coevaporated with acetonitrile (1.5 L each) and dried (0.1 mm Hg, 25° C., 24 h) to 2520 g of a dense foam. High pressure liquid chromatography (HPLC) revealed a contamination of 6.3% of N4, 3′-O-dibenzoyl product, but very little other impurities.

[0150] THe product was purified by Biotage column chromatography (5 kg Biotage) prepared with 65:35:1 hexanes-EtOAc-TEA (4L). The crude product (800 g), dissolved in CH₂Cl₂ (2 L), was applied to the column. The column was washed with the 65:35:1 solvent mixture (20 kg), then 20:80:1 solvent mixture (10 kg), then 99:1 EtOAc:TEA (17 kg). The fractions containing the product were collected, and any fractions containing the product and impurities were retained to be resubjected to column chromatography. The column was re-equilibrated with the original 65:35:1 solvent mixture (17 kg). A second batch of crude product (840 g) was applied to the column as before. The column was washed with the following solvent gradients: 65:35:1 (9 kg), 55:45:1 (20 kg), 20:80:1 (10 kg), and 99:1 EtOAc:TEA(15 kg). The column was reequilibrated as above, and a third batch of the crude product (850 g) plus impure fractions recycled from the two previous columns (28 g) was purified following the procedure for the second batch. The fractions containing pure product combined and concentrated on a 20L rotary evaporator, co-evaporated with acetontirile (3 L) and dried (0.1 mm Hg, 48 h, 25° C.) to a constant weight of 2023 g (85%) of white foam and 20 g of slightly contaminated product from the third run. HPLC indicated a purity of 99.8% with the balance as the diBenzoyl product.

[0151] [5′-O-(4,4′-Dimethoxytriphenylmethyl)-2′-deoxy-N⁴-benzoyl-5-methylcytidin-3′-O-yl]-2-cyanoethyl-N,N-diisopropylphosphoramidite 5-methyl dC amidite)

[0152] 5′-O-4,4′-Dimethoxytriphenylmethyl)-2′-deoxy-N₄-benzoyl-5-methylcytidine (998 g, 1.5 mol) was dissolved in anhydrous DMF (2 L). The solution was co-evaporated with toluene (300 ml) at 50° C. under reduced pressure, then cooled to room temperature and 2-cyanoethyl tetraisopropylphosphorodiamidite (680 g, 2.26 mol) and tetrazole (52.5 g, 0.75 mol) were added. The mixture was shaken until all tetrazole was dissolved, N-methylimidazole (15 ml) was added and the mixture was left at room temperature for 5 hours. TEA (300 ml) was added, the mixture was diluted with DMF (2.5 L) and water (600 ml), and extracted with heane (3×3 L). The mixture was diluted with water (1.2 L) and extracted with a mixture of toluene (7.5 L) and heane (6 L). The two layers were separated, the upper layer was washed with DMF-water (7:3 v/v, 3×2 L) and water (3×2 L), and the phases were separated. The organic layer was dried (Na₂SO4), filtered and rotary evaporated. The residue was co-evaporated with acetonitrile (2×2 L) under reduced pressure and dried to a constant weight (25° C., 0.1 mm Hg, 40 h) to afford 1250 g an off-white foam solid (96%).

[0153] 2′-Fluoro amidites

[0154] 2′-Fluorodeoxyadenosine amidites

[0155] 2′-fluoro oligonucleotides were synthesized as described previously [Kawasaki, et. al., J. Med. Chem., 1993, 36, 831-841] and U.S. Pat. No. 5,670,633, herein incorporated by reference. The preparation of 2′-fluoropyrimidines containing a 5-methyl substitution are described in U.S. Pat. No. 5,861,493. Briefly, the protected nucleoside N6-benzoyl-2′-deoxy-2′-fluoroadenosine was synthesized utilizing commercially available 9-beta-D-arabinofuranosyladenine as starting material and whereby the 2′-alpha-fluoro atom is introduced by a S_(N)2-displacement of a 2′-beta-triflate group. Thus N6-benzoyl-9-beta-D-arabinofuranosyladenine was selectively protected in moderate yield as the 3′,5′-ditetrahydropyranyl (THP) intermediate. Deprotection of the THP and N6-benzoyl groups was accomplished using standard methodologies to obtain the 5′-dimethoxytrityl-(DMT) and 5′-DMT-3′-phosphoramidite intermediates.

[0156] 2′-Fluorodeoxyguanosine

[0157] The synthesis of 2′-deoxy-2′-fluoroguanosine was accomplished using tetraisopropyldisiloxanyl (TPDS) protected 9-beta-D-arabinofuranosylguanine as starting material, and conversion to the intermediate isobutyryl-arabinofuranosylguanosine. Alternatively, isobutyryl-arabinofuranosylguanosine was prepared as described by Ross et al., Nucleosides & Nucleosides, 16, 1645, 1997). Deprotection of the TPDS group was followed by protection of the hydroxyl group with THP to give isobutyryl di-THP protected arabinofuranosylguanine. Selective O-deacylation and triflation was followed by treatment of the crude product with fluoride, then deprotection of the THP groups. Standard methodologies were used to obtain the 5′-DMT- and 5′-DMT-3′-phosphoramidites.

[0158] 2′-Fluorouridine

[0159] Synthesis of 2′-deoxy-2′-fluorouridine was accomplished by the modification of a literature procedure in which 2,2′-anhydro-1-beta-D-arabinofuranosyluracil was treated with 70% hydrogen fluoride-pyridine. Standard procedures were used to obtain the 5′-DMT and 5′-DMT-3′phosphoramidites.

[0160] 2′-Fluorodeoxycytidine

[0161] 2′-deoxy-2′-fluorocytidine was synthesized via amination of 2′-deoxy-2′-fluorouridine, followed by selective protection to give N4-benzoyl-2′-deoxy-2′-fluorocytidine. Standard procedures were used to obtain the 5′-DMT and 5′-DMT-3′phosphoramidites.

[0162] 2′-O-(2-Methoxyethyl) Modified Amidites

[0163] 2′-O-Methoxyethyl-substituted nucleoside amidites (otherwise known as MOE amidites) are prepared as follows, or alternatively, as per the methods of Martin, P., (Helvetica Chimica Acta, 1995, 78, 486-504).

[0164] Preparation of 2′-O-(2-methoxyethyl)-5-methyluridine Intermediate

[0165] 2,2′-Anhydro-5-methyl-uridine (2000 g, 8.32 mol), tris2-methoxyethyl)borate (2504 g, 10.60 mol), sodium bicarbonate (60 g, 0.70 mol) and anhydrous 2-methoxyethanol (5 L) were combined in a 12 L three necked flask and heated to 130° C. (internal temp) at atmospheric pressure, under an argon atmosphere with stirring for 21 h. TLC indicated a complete reaction. The solvent was removed under reduced pressure until a sticky gum formed (50-85° C. bath temp and 100-11 mm Hg) and the residue was redissolved in water (3 L) and heated to boiling for 30 min in order the hydrolyze the borate esters. The water was removed under reduced pressure until a foam began to form and then the process was repeated. HPLC indicated about 77% product, 15% dimer (5′ of product attached to 2′ of starting material) and unknown derivatives, and the balance was a single unresolved early eluting peak.

[0166] The gum was redissolved in brine (3 L), and the flask was rinsed with additional brine (3 L). The combined aqueous solutions were extracted with chloroform (20 L) in a heavier-than continuous extractor for 70 h. The chloroform layer was concentrated by rotary evaporation in a 20 L flask to a sticky foam (2400 g). This was coevaporated with MeOH (400 mL) and EtOAc (8 L) at 75° C. and 0.65 atm until the foam dissolved at which point the vacuum was lowered to about 0.5 atm. After 2.5 L of distillate was collected a precipitate began to form and the flask was removed from the rotary evaporator and stirred until the suspension reached ambient temperature. EtOAc (2 L) was added and the slurry was filtered on a 25 cm table top Buchner funnel and the product was washed with EtOAc (3×2 L). The bright white solid was air dried in pans for 24 h then further dried in a vacuum oven (50° C., 0.1 mm Hg, 24 h) to afford 1649 g of a white crystalline solid (mp 115.5-116.5° C.).

[0167] The brine layer in the 20 L continuous extractor was further extracted for 72 h with recycled chloroform. The chloroform was concentrated to 120 g of oil and this was combined with the mother liquor from the above filtration (225 g), dissolved in brine (250 mL) and extracted once with chloroform (250 mL). The brine solution was continuously extracted and the product was crystallized as described above to afford an additional 178 g of crystalline product containing about 2% of thymine. The combined yield was 1827 g (69.4%). HPLC indicated about 99.5% purity with the balance being the dimer.

[0168] Preparation of 5′-O-DMT-2′-O-(2-methoxyethyl)-5-methyluridine Penultimate Intermediate

[0169] In a 50 L glass-lined steel reactor, 2′-O-(2-methoxyethyl)-5-methyl-uridine (MOE-T, 1500 g, 4.738 mol), lutidine (1015 g, 9.476 mol) were dissolved in anhydrous acetonitrile (15 L). The solution was stirred rapidly and chilled to −10° C. (internal temperature). Dimethoxytriphenylmethyl chloride (1765.7 g, 5.21 mol) was added as a solid in one portion. The reaction was allowed to warm to −2° C. over 1 h. (Note: The reaction was monitored closely by TLC (EtOAc) to determine when to stop the reaction so as to not generate the undesired bis-DMT substituted side product). The reaction was allowed to warm from −2 to 3° C. over 25 min. then quenched by adding MeOH (300 mL) followed after 10 min by toluene (16 L) and water (16 L). The solution was transferred to a clear 50 L vessel with a bottom outlet, vigorously stirred for 1 minute, and the layers separated. The aqueous layer was removed and the organic layer was washed successively with 10% aqueous citric acid (8 L) and water (12 L). The product was then extracted into the aqueous phase by washing the toluene solution with aqueous sodium hydroxide (0.5N, 16 L and 8 L). The combined aqueous layer was overlayed with toluene (12 L) and solid citric acid (8 moles, 1270 g) was added with vigorous stirring to lower the pH of the aqueous layer to 5.5 and extract the product into the toluene. The organic layer was washed with water (10 L) and TLC of the organic layer indicated a trace of DMT-O-Me, bis DMT and dimer DMT.

[0170] The toluene solution was applied to a silica gel column (6 L sintered glass funnel containing approx. 2 kg of silica gel slurried with toluene (2 L) and TEA(25 mL)) and the fractions were eluted with toluene (12 L) and EtOAc (3×4 L) using vacuum applied to a filter flask placed below the column. The first EtOAc fraction containing both the desired product and impurities were resubjected to column chromatography as above. The clean fractions were combined, rotary evaporated to a foam, coevaporated with acetonitrile (6 L) and dried in a vacuum oven (0.1 mm Hg, 40 h, 40° C.) to afford 2850 g of a white crisp foam. NMR spectroscopy indicated a 0.25 mole % remainder of acetonitrile (calculates to be approx. 47 g) to give a true dry weight of 2803 g (96%). HPLC indicated that the product was 99.41% pure, with the remainder being 0.06 DMT-O-Me, 0.10 unknown, 0.44 bis DMT, and no detectable dimer DMT or 3′-O-DMT.

[0171] Preparation of [5′-O-(4,4′-Dimethoxytriphenylmethyl)-2′-O-(2-methoxyethyl)-5-methyluridin-3′-O-yl]-2-cyanoethyl-N,N-diisopropylphosphoramidite (MOE T amidite)

[0172] 5′-O-(4,4′-Dimethoxytriphenylmethyl)-2′-O-(2-methoxyethyl)-5-methyluridine (1237 g, 2.0 mol) was dissolved in anhydrous DMF (2.5 L). The solution was co-evaporated with toluene (200 ml) at 50° C. under reduced pressure, then cooled to room temperature and 2-cyanoethyl tetraisopropylphosphorodiamidite (900 g, 3.0 mol) and tetrazole (70 g, 1.0 mol) were added. The mixture was shaken until all tetrazole was dissolved, N-methylimidazole (20 ml) was added and the solution was left at room temperature for 5 hours. TEA (300 ml) was added, the mixture was diluted with DMF (3.5 L) and water (600 ml) and extracted with heane (3×3L). The mixture was diluted with water (1.6 L) and extracted with the mixture of toluene (12 L) and hexanes (9 L). The upper layer was washed with DMF-water (7:3 v/v, 3×3 L) and water (3×3 L). The organic layer was dried (Na₂SO₄), filtered and evaporated. The residue was co-evaporated with acetonitrile (2×2 L) under reduced pressure and dried in a vacuum oven (25° C., 0.1 mm Hg, 40 h) to afford 1526 g of an off-white foamy solid (95%).

[0173] Preparation of 5′-O-Dimethoxytrityl-2′-O-(2-methoxyethyl)-5-methylcytidine Intermediate

[0174] To a 50 L Schott glass-lined steel reactor equipped with an electric stirrer, reagent addition pump (connected to an addition funnel), heating/cooling system, internal thermometer and argon gas line was added 5′-O-dimethoxytrityl-2′-O-(2-methoxyethyl)-5-methyl-uridine (2.616 kg, 4.23 mol, purified by base extraction only and no scrub column), anhydrous acetonitrile (20 L), and TEA (9.5 L, 67.7 mol, 16 eq). The mixture was chilled with stirring to −10° C. internal temperature (external −20° C.). Trimethylsilylchloride (1.60 L, 12.7 mol, 3.0 eq) was added over 30 min. while maintaining the internal temperature below −5° C., followed by a wash of anhydrous acetonitrile (1 L). (Note: the reaction is mildly exothermic and copious hydrochloric acid fumes form over the course of the addition). The reaction was allowed to warm to 0° C. and the reaction progress was confirmed by TLC (EtOAc, R_(f) 0.68 and 0.87 for starting material and silyl product, respectively). Upon completion, triazole (2.34 kg, 33.8 mol, 8.0 eq) was added the reaction was cooled to −20° C. internal temperature (external −30° C.). Phosphorous oxychloride (793 mL, 8.51 mol. 2.01 eq) was added slowly over 60 min so as to maintain the temperature between −20° C. and −10° C. (note: strongly exothermic), followed by a wash of anhydrous acetonitrile (1 L). The reaction was warmed to 0° C. and stirred for 1 h, at which point it was an off-white thick suspension. TLC indicated a complete conversion to the triazole product (EtOAc, R_(f) 0.87 to 0.75 with the product spot glowing in long wavelength UV light). The reaction was cooled to −15° C. and water (5 L) was slowly added at a rate to maintain the temperature below +10° C. in order to quench the reaction and to form a homogenous solution. (Caution: this reaction is initially very strongly exothermic). Approximately one-half of the reaction volume (22 L) was transferred by air pump to another vessel, diluted with EtOAc (12 L) and extracted with water (2×8 L). The second half of the reaction was treated in the same way. The combined aqueous layers were back-extracted with EtOAc (8 L) The organic layers were combined and concentrated in a 20 L rotary evaporator to an oily foam. The foam was coevaporated with anhydrous acetonitrile (4 L) to remove EtOAc. (note: dioxane may be used instead of anhydrous acetonitrile if dried to a hard foam). The residue was dissolved in dioxane (2 L) and concentrated ammonium hydroxide (750 mL) was added. A homogenous solution formed in a few minutes and the reaction was allowed to stand overnight

[0175] TLC indicated a complete reaction (CH₂Cl₂-acetone-MeOH, 20:5:3, R_(f) 0.51). The reaction solution was concentrated on a rotary evaporator to a dense foam and slowly redissolved in warm CH₂Cl₂ (4 L, 40° C.) and transferred to a 20 L glass extraction vessel equipped with a air-powered stirrer. The organic layer was extracted with water (2×6 L) to remove the triazole by-product. (Note: In the first extraction an emulsion formed which took about 2 h to resolve). The water layer was back-extracted with CH₂Cl₂ (2×2 L), which in turn was washed with water (3 L). The combined organic layer was concentrated in 2×20 L flasks to a gum and then recrystallized from EtOAc seeded with crystalline product. After sitting overnight, the first crop was collected on a 25 cm Coors Buchner funnel and washed repeatedly with EtOAc until a white free-flowing powder was left (about 3×3 L). The filtrate was concentrated to an oil recrystallized from EtOAc, and collected as above. The solid was air-dried in pans for 48 h, then further dried in a vacuum oven (50° C., 0.1 mm Hg, 17 h) to afford 2248 g of a bright white, dense solid (86%). An HPLC analysis indicated both crops to be 99.4% pure and NMR spectroscopy indicated only a faint trace of EtOAc remained.

[0176] Preparation of 5′-O-dimethoxytrityl-2′-O-(2-methoxyethyl)-N4-benzoyl-5-methyl-cytidine Penultimate Intermediate:

[0177] Crystalline 5′-O-dimethoxytrityl-2′-O-(2-methoxyethyl)-5-methyl-cytidine (1000 g, 1.62 mol) was suspended in anhydrous DMF (3 kg) at ambient temperature and stirred under an Ar atmosphere. Benzoic anhydride (439.3 g, 1.94 mol) was added in one portion. The solution clarified after 5 hours and was stirred for 16 h. HPLC indicated 0.45% starting material remained (as well as 0.32% N4, 3′-O-bis Benzoyl). An additional amount of benzoic anhydride (6.0 g, 0.0265 mol) was added and after 17 h, HPLC indicated no starting material was present. TEA (450 mL, 3.24 mol) and toluene (6 L) were added with stirring for 1 minute. The solution was washed with water (4×4 L), and brine (2×4 L). The organic layer was partially evaporated on a 20 L rotary evaporator to remove 4 L of toluene and traces of water. HPLC indicated that the bis benzoyl side product was present as a 6% impurity. The residue was diluted with toluene (7 L) and anhydrous DMSO (200 mL, 2.82 mol) and sodium hydride (60% in oil, 70 g, 1.75 mol) was added in one portion with stirring at ambient temperature over 1 h. The reaction was quenched by slowly adding then washing with aqueous citric acid (10%, 100 mL over 10 min, then 2×4 L), followed by aqueous sodium bicarbonate (2%, 2 L), water (2×4 L) and brine (4 L). The organic layer was concentrated on a 20 L rotary evaporator to about 2 L total volume. The residue was purified by silica gel column chromatography (6 L Buchner funnel containing 1.5 kg of silica gel wetted with a solution of EtOAc-hexanes-TEA(70:29:1)). The product was eluted with the same solvent (30 L) followed by straight EtOAc (6 L). The fractions containing the product were combined, concentrated on a rotary evaporator to a foam and then dried in a vacuum oven (50° C., 0.2 mm Hg, 8 h) to afford 1155 g of a crisp, white foam (98%). HPLC indicated a purity of >99.7%.

[0178] Preparation of [5′-O-(4,4′-Dimethoxytriphenylmethyl)-2′-O-(2-methoxyethyl) -N⁴-benzoyl-5-methylcytidin-3′-O-yl]-2-cyanoethyl-N,N-diisopropylphosphoramidite (MOE 5-Me-C amidite)

[0179] 5′-O-(4,4′-Dimethoxytriphenylmethyl)-2′-O-(2-methoxyethyl)-N⁴-benzoyl-5-methylcytidine (1082 g, 1.5 mol) was dissolved in anhydrous DMF (2 L) and co-evaporated with toluene (300 ml) at 50° C. under reduced pressure. The mixture was cooled to room temperature and 2-cyanoethyl tetraisopropylphosphorodiamidite (680 g, 2.26 mol) and tetrazole (52.5 g, 0.75 mol) were added. The mixture was shaken until all tetrazole was dissolved, N-methylimidazole (30 ml) was added, and the mixture was left at room temperature for 5 hours. TEA (300 ml) was added, the mixture was diluted with DMF (1 L) and water (400 ml) and extracted with heane (3×3 L). The mixture was diluted with water (1.2 L) and extracted with a mixture of toluene (9 L) and hexanes (6 L). The two layers were separated and the upper layer was washed with DMF-water (60:40 v/v, 3×3 L) and water (3×2 L). The organic layer was dried (Na₂SO₄), filtered and evaporated. The residue was co-evaporated with acetonitrile (2×2 L) under reduced pressure and dried in a vacuum oven (25° C., 0.1 mm Hg, 40 h) to afford 1336 g of an off-white foam (97%).

[0180] Preparation of [5′-O-(4,4′-Dimethoxytriphenylmethyl)-2′-O-(2-methoxyethyl)-N⁶-benzoyladenosin-3′-O-yl]-2-cyanoethyl-N,N-diisopropylphosphoramidite (MOE A amdite)

[0181] 5′-O-(4,4′-Dimethoxytriphenylmethyl)-2′-O-(2-methoxyethyl)-N⁶-benzoyladenosine (purchased from Reliable Biopharmaceutical, St. Lois, Mo.), 1098 g, 1.5 mol) was dissolved in anhydrous DMF (3 L) and co-evaporated with toluene (300 ml) at 50° C. The mixture was cooled to room temperature and 2-cyanoethyl tetraisopropylphosphorodiamidite (680 g, 2.26 mol) and tetrazole (78.8 g, 1.24 mol) were added. The mixture was shaken until all tetrazole was dissolved, N-methylimidazole (30 ml) was added, and mixture was left at room temperature for 5 hours. TEA (300 ml) was added, the mixture was diluted with DMF (1 L) and water (400 ml) and extracted with hexanes (3×3 L). The mixture was diluted with water (1.4 L) and extracted with the mixture of toluene (9 L) and hexanes (6 L). The two layers were separated and the upper layer was washed with DMF-water (60:40, v/v, 3×3 L) and water (3×2 L). The organic layer was dried (Na₂SO₄) filtered and evaporated to a sticky foam. The residue was co-evaporated with acetonitrile (2.5 L) under reduced pressure and dried in a vacuum oven (25 ° C., 0.1 mm Hg, 40 h) to afford 1350 g of an off-white foam solid (96%).

[0182] Prepartion of [5′-O-(4,4′-Dimethoxytriphenylmethyl)-2′-O- (2-methoxyethyl)-N⁴-isobutyrylguanosin-3′-O-yl]-2-cyanoethyl-N,N-diisopropylphosphoramidite (MOE G amidite)

[0183] 5′-O-(4,4′-Dimethoxytriphenylmethyl)-2′-O-(2-methoxyethyl)-N⁴-isobutyrlguanosine (purchased from Reliable Biopharmaceutical, St. Louis, Mo., 1426 g, 2.0 mol) was dissolved in anhydrous DMF (2 L). The solution was co-evaporated with toluene (200 ml) at 50° C., cooled to room temperature and 2-cyanoethyl tetraisopropylphosphorodiamidite (900 g, 3.0 mol) and tetrazole (68 g, 0.97 mol) were added. The mixture was shaken until all tetrazole was dissolved, N-methylimidazole (30 ml) was added, and the mixture was left at room temperature for 5 hours. TEA (300 ml) was added, the mixture was diluted with DMF (2 L) and water (600 ml) and extracted with hexanes (3×3 L). The mixture was diluted with water (2 L) and extracted with a mixture of toluene (10 L) and hexanes (5 L). The two layers were separated and the upper layer was washed with DMF-water (60:40, v/v, 3×3 L). EtOAc (4 L) was added and the solution was washed with water (3×4 L). The organic layer was dried (Na₂SO₄), filtered and evaporated to approx. 4 kg. Hexane (4 L) was added, the mixture was shaken for 10 min, and the supernatant liquid was decanted. The residue was co-evaporated with acetonitrile (2×2 L) under reduced pressure and dried in a vacuum oven (25° C., 0.1 mm Hg, 40 h) to afford 1660 g of an off-white foamy solid (91%).

[0184] 2′-O-(Aminooxyethyl) nucleoside amidites and 2′-O-(dimethylaminooxyethyl) nucleoside amidites

[0185] 2′-(Dimethylaminooxyethoxy) nucleoside amidites

[0186] 2′-(Dimethylaminooxyethoxy) nucleoside amidites (also known in the art as 2′-O-(dimethylaminooxyethyl) nucleoside amidites) are prepared as described in the following paragraphs. Adenosine, cytidine and guanosine nucleoside amidites are prepared similarly to the thymidine (5-methyluridine) except the exocyclic amines are protected with a benzoyl moiety in the case of adenosine and cytidine and with isobutyryl in the case of guanosine.

[0187] 5′-O-tert-Butyldiphenylsilyl-O²-2′-anhydro-5-methyluridine

[0188] O²-2′-anhydro-5-methyluridine (Pro. Bio. Sint., Varese, Italy, 100.0 g, 0.416 mmol), dimethylaminopyridine (0.66 g, 0.013eq, 0.0054 mmol) were dissolved in dry pyridine (500 ml) at ambient temperature under an argon atmosphere and with mechanical stirring. tert-Butyldiphenylchlorosilane (125.8 g, 119.0 mL, 1.1 eq, 0.458 mmol) was added in one portion. The reaction was stirred for 16 h at ambient temperature. TLC (R_(f) 0.22, EtOAc) indicated a complete reaction. The solution was concentrated under reduced pressure to a thick oil. This was partitioned between CH₂Cl₂ (1 L) and saturated sodium bicarbonate (2×1 L) and brine (1 L). The organic layer was dried over sodium sulfate, filtered, and concentrated under reduced pressure to a thick oil. The oil was dissolved in a 1:1 mixture of EtOAc and ethyl ether (600 mL) and cooling the solution to −10° C. afforded a white crystalline solid which was collected by filtration, washed with ethyl ether (3×2 00 mL) and dried (40° C., 1 mm Hg, 24 h) to afford 149 g of white solid (74.8%). TLC and NMR spectroscopy were consistent with pure product.

[0189] 5′-O-tert-Butyldiphenylsilyl-2′-O-(2-hydroxyethyl)-5-methyluridine

[0190] In the fume hood, ethylene glycol (350 mL, excess) was added cautiously with manual stirring to a 2 L stainless steel pressure reactor containing borane in tetrahydrofuran (1.0 M, 2.0 eq, 622 mL). (Caution : evolves hydrogen gas). 5′-O-tert-Butyldiphenylsilyl-O²-2′-anhydro-5-methyluridine (149 g, 0.311 mol) and sodium bicarbonate (0.074 g, 0.003 eq) were added with manual stirring. The reactor was sealed and heated in an oil bath until an internal temperature of 160° C. was reached and then maintained for 16 h (pressure <100 psig). The reaction vessel was cooled to ambient temperature and opened. TLC (EtOAc, R_(f) 0.67 for desired product and R_(f) 0.82 for ara-T side product) indicated about 70% conversion to the product. The solution was concentrated under reduced pressure (10 to 1 mm Hg) in a warm water bath (40-100° C.) with the more extreme conditions used to remove the ethylene glycol. (Alternatively, once the THF has evaporated the solution can be diluted with water and the product extracted into EtOAc). The residue was purified by column chromatography (2 kg silica gel, EtOAc-hexanes gradient 1:1 to 4:1). The appropriate fractions were combined, evaporated and dried to afford 84 g of a white crisp foam (50%), contaminated starting material (17.4 g, 12% recovery) and pure reusable starting material (20 g, 13% recovery). TLC and NMR spectroscopy were consistent with 99% pure product.

[0191] 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine

[0192] 5′-O-tert-Butyldiphenylsilyl-2′-O-(2-hydroxyethyl)-5-methyluridine (20 g, 36.98 mmol) was mixed with triphenylphosphine (11.63 g, 44.36 mmol) and N-hydroxyphthalimide (7.24 g, 44.36 mmol) and dried over P₂O₅ under high vacuum for two days at 40° C. The reaction mixture was flushed with argon and dissolved in dry THF (369.8 mL, Aldrich, sure seal bottle). Diethyl-azodicarboxylate (6.98 mL, 44.36 mmol) was added dropwise to the reaction mixture with the rate of addition maintained such that the resulting deep red coloration is just discharged before adding the next drop. The reaction mixture was stirred for 4 hrs., after which time TLC (EtOAc:heane, 60:40) indicated that the reaction was complete. The solvent was evaporated in vacuuo and the residue purified by flash column chromatography (eluted with 60:40 EtOAc:heane), to yield 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine as white foam (21.819 g, 86%) upon rotary evaporation.

[0193] 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy)ethyl]-5-methyluridine

[0194] 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine (3.1 g, 4.5 mmol) was dissolved in dry CH₂Cl₂ (4.5 mL) and methylhydrazine (300 mL, 4.64 mmol) was added dropwise at −10° C. to 0° C. After 1 h the mixture was filtered, the filtrate washed with ice cold CH₂Cl₂, and the combined organic phase was washed with water and brine and dried (anhydrous Na₂SO₄). The solution was filtered and evaporated to afford 2′-O-(aminooxyethyl) thymidine, which was then dissolved in MeOH (67.5 mL). Formaldehyde (20% aqueous solution, w/w, 1.1 eq.) was added and the resulting mixture was stirred for 1 h. The solvent was removed under vacuum and the residue was purified by column chromatography to yield 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy) ethyl]-5-methyluridine as white foam (1.95 g, 78%) upon rotary evaporation.

[0195] 5′-O-tert-Butyldiphenylsilyl-2′-O-[N,N dimethylaminooxyethyl]-5-methyluridine

[0196] 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy)ethyl]-5-methyluridine (1.77 g, 3.12 mmol) was dissolved in a solution of 1M pyridinium p-toluenesulfonate (PPTS) in dry MeOH (30.6 mL) and cooled to 10° C. under inert atmosphere. Sodium cyanoborohydride (0.39 g, 6.13 mmol) was added and the reaction mixture was stirred. After 10 minutes the reaction was warmed to room temperature and stirred for 2 h. while the progress of the reaction was monitored by TLC (5% MeOH in CH₂Cl₂). Aqueous NaHCO₃ solution (5%, 10 mL) was added and the product was extracted with EtOAc (2×20 mL). The organic phase was dried over anhydrous Na₂SO₄, filtered, and evaporated to dryness. This entire procedure was repeated with the resulting residue, with the exception that formaldehyde (20% w/w, 30 mL, 3.37 mol) was added upon dissolution of the residue in the PPTS/MeOH solution. After the extraction and evaporation, the residue was purified by flash column chromatography and (eluted with 5% MeOH in CH₂Cl₂) to afford 5′-O-tert-butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine as a white foam (14.6 g, 80%) upon rotary evaporation.

[0197] 2′-O-(dimethylaminooxyethyl)-5-methyluridine

[0198] Triethylamine trihydrofluoride (3.91 mL, 24.0 mmol) was dissolved in dry THF and TEA (1.67 mL, 12 mmol, dry, stored over KOH) and added to ′-O-tert-butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine (1.40 g, 2.4 mmol). The reaction was stirred at room temperature for 24 hrs and monitored by TLC (5% MeOH in CH₂Cl₂). The solvent was removed under vacuum and the residue purified by flash column chromatography (eluted with 10% MeOH in CH₂Cl₂) to afford 2′-O-(dimethylaminooxyethyl)-5-methyluridine (766 mg, 92.5%) upon rotary evaporation of the solvent.

[0199] 5′-O-DMT-2′-O-(dimethylaminooxyethyl)-5-methyluridine

[0200] 2′-O-(dimethylaminooxyethyl)-5-methyluridine (750 mg, 2.17 mmol) was dried over P₂O₅ under high vacuum overnight at 40° C., co-evaporated with anhydrous pyridine (20 mL), and dissolved in pyridine (11 mL) under argon atmosphere. 4-dimethylaminopyridine (26.5 mg, 2.60 mmol) and 4,4′-dimethoxytrityl chloride (880 mg, 2.60 mmol) were added to the pyridine solution and the reaction mixture was stirred at room temperature until all of the starting material had reacted. Pyridine was removed under vacuum and the residue was purified by column chromatography (eluted with 10% MeOH in CH₂Cl₂ containing a few drops of pyridine) to yield 5′-O-DMT-2′-O-(dimethylamino-oxyethyl)-5-methyluridine (1.13 g, 80%) upon rotary evaporation.

[0201] 5′-O-DMT-2′-O-(2-N,N-dimethylaminooxyethyl)-5-methyluridine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite]

[0202] 5′-O-DMT-2′-O-(dimethylaminooxyethyl)-5-methyluridine (1.08 g, 1.67 mmol) was co-evaporated with toluene (20 mL), N,N-diisopropylamine tetrazonide (0.29 g, 1.67 mmol) was added and the mixture was dried over P₂O₅ under high vacuum overnight at 40° C. This was dissolved in anhydrous acetonitrile (8.4 mL) and 2-cyanoethyl-N,N,N¹,N¹-tetraisopropylphosphoramidite (2.12 mL, 6.08 mmol) was added. The reaction mixture was stirred at ambient temperature for 4 h under inert atmosphere. The progress of the reaction was monitored by TLC (hexane:EtOAc 1:1). The solvent was evaporated, then the residue was dissolved in EtOAc (70 mL) and washed with 5% aqueous NaHCO₃ (40 mL). The EtOAc layer was dried over anhydrous Na₂SO₄, filtered, and concentrated. The residue obtained was purified by column chromatography (EtOAc as eluent) to afford 5′-O-DMT-2′-O-(2-N,N-dimethylaminooxyethyl)-5-methyluridine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite] as a foam (1.04 g, 74.9%) upon rotary evaporation.

[0203] 2′-(Aminooxyethoxy) Nucleoside Amidites

[0204] 2′-(Aminooxyethoxy) nucleoside amidites (also known in the art as 2′-O-(aminooxyethyl) nucleoside amidites) are prepared as described in the following paragraphs. Adenosine, cytidine and thymidine nucleoside amidites are prepared similarly.

[0205] N2-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-ethylacetyl)-5′-O-(4,4′-dimethoxytrityl)guanosine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite]

[0206] The 2′-O-aminooxyethyl guanosine analog may be obtained by selective 2′-O-alkylation of diaminopurine riboside. Multigram quantities of diaminopurine riboside may be purchased from Schering AG (Berlin) to provide 2′-O-(2-ethylacetyl) diaminopurine riboside along with a minor amount of the 3′-O-isomer. 2′-O-(2-ethylacetyl) diaminopurine riboside may be resolved and converted to 2′-O-(2-ethylacetyl)guanosine by treatment with adenosine deaminase. (McGee, D. P. C., Cook, P. D., Guinosso, C. J., WO 94/02501 A1 940203.) Standard protection procedures should afford 2′-O-(2-ethylacetyl)-5′-O-(4,4′-dimethoxytrityl)guanosine and 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-ethylacetyl)-5′-O-(4,4′-dimethoxytrityl)guanosine which may be reduced to provide 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-hydroxyethyl)-5′-O-(4,4′-dimethoxytrityl)guanosine. As before the hydroxyl group may be displaced by N-hydroxyphthalimide via a Mitsunobu reaction, and the protected nucleoside may be phosphitylated as usual to yield 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-([2-phthalmidoxy]ethyl)-5′-O-(4,4′-dimethoxytrityl)guanosine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite].

[0207] 2′-dimethylaminoethoxyethoxy (2′-DMAEOE) Nucleoside Amidites

[0208] 2′-dimethylaminoethoxyethoxy nucleoside amidites (also known in the art as 2′-O-dimethylaminoethoxyethyl, i.e., 2′-O—CH₂—O—CH₂—N(CH₂)₂, or 2′-DMAEOE nucleoside amidites) are prepared as follows. Other nucleoside amidites are prepared similarly.

[0209] 2′-O-[2(2-N,N-dimethylaminoethoxy)ethyl]-5-methyl uridine

[0210] 2[2-(Dimethylamino)ethoxy]ethanol (Aldrich, 6.66 g, 50 mmol) was slowly added to a solution of borane in tetra-hydrofuran (1 M, 10 mL, 10 mmol) with stirring in a 100 mL bomb. (Caution: Hydrogen gas evolves as the solid dissolves). O²-,2′-anhydro-5-methyluridine (1.2 g, 5 mmol), and sodium bicarbonate (2.5 mg) were added and the bomb was sealed, placed in an oil bath and heated to 155° C. for 26 h. then cooled to room temperature. The crude solution was concentrated, the residue was diluted with water (200 mL) and extracted with hexanes (200 mL). The product was extracted from the aqueous layer with EtOAc (3×200 mL) and the combined organic layers were washed once with water, dried over anhydrous sodium sulfate, filtered and concentrated. The residue was purified by silica gel column chromatography (eluted with 5:100:2 MeOH/CH₂Cl₂/TEA) as the eluent. The appropriate fractions were combined and evaporated to afford the product as a white solid. 5′-O-dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy) ethyl)]-5-methyl Uridine

[0211] To 0.5 g (1.3 mmol) of 2′-O-[2(2-N,N-dimethylamino-ethoxy)ethyl)]-5-methyl uridine in anhydrous pyridine (8 mL), was added TEA (0.36 mL) and dimethoxytrityl chloride (DMT-Cl, 0.87 g, 2 eq.) and the reaction was stirred for 1 h. The reaction mixture was poured into water (200 mL) and extracted with CH₂Cl₂ (2×200 mL). The combined CH₂Cl₂ layers were washed with saturated NaHCO₃ solution, followed by saturated NaCl solution, dried over anhydrous sodium sulfate, filtered and evaporated. The residue was purified by silica gel column chromatography (eluted with 5:100:1 MeOH/CH₂Cl₂/TEA) to afford the product.

[0212] 5′-O-Dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy)-ethyl)]-5-methyl uridine-3′-O-(cyanoethyl-N,N-diisopropyl)phosphoramidite

[0213] Diisopropylaminotetrazolide (0.6 g) and 2-cyanoethoxy-N,N-diisopropyl phosphoramidite (1.1 mL, 2 eq.) were added to a solution of 5′-O-dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy)ethyl)]-5-methyluridine (2.17 g, 3 mmol) dissolved in CH₂Cl₂ (20 mL) under an atmosphere of argon. The reaction mixture was stirred overnight and the solvent evaporated. The resulting residue was purified by silica gel column chromatography with EtOAc as the eluent to afford the title compound.

Example 2

[0214] Oligonucleotide Synthesis

[0215] Unsubstituted and substituted phosphodiester (P=O) oligonucleotides are synthesized on an automated DNA synthesizer (Applied Biosystems model 394) using standard phosphoramidite chemistry with oxidation by iodine.

[0216] Phosphorothioates (P═S) are synthesized similar to phosphodiester oligonucleotides with the following exceptions: thiation was effected by utilizing a 10% w/v solution of 3H-1,2-benzodithiole-3-one 1,1-dioxide in acetonitrile for the oxidation of the phosphite linkages. The thiation reaction step time was increased to 180 sec and preceded by the normal capping step. After cleavage from the CPG column and deblocking in concentrated ammonium hydroxide at 55° C. (12-16 hr), the oligonucleotides were recovered by precipitating with >3 volumes of ethanol from a 1 M NH₄OAc solution. Phosphinate oligonucleotides are prepared as described in U.S. Pat. No. 5,508,270, herein incorporated by reference.

[0217] Alkyl phosphonate oligonucleotides are prepared as described in U.S. Pat. No. 4,469,863, herein incorporated by reference. 3′-Deoxy-3′-methylene phosphonate oligonucleotides are prepared as described in U.S. Pat. Nos. 5,610,289 or 5,625,050, herein incorporated by reference.

[0218] Phosphoramidite oligonucleotides are prepared as described in U.S. Pat. No. 5,256,775 or U.S. Pat. No. 5,366,878, herein incorporated by reference.

[0219] Alkylphosphonothioate oligonucleotides are prepared as described in published PCT applications PCT/US94/00902 and PCT/US93/06976 (published as WO 94/17093 and WO 94/02499, respectively), herein incorporated by reference.

[0220] 3′-Deoxy-3′-amino phosphoramidate oligonucleotides are prepared as described in U.S. Pat. No. 5,476,925, herein incorporated by reference.

[0221] Phosphotriester oligonucleotides are prepared as described in U.S. Pat. No. 5,023,243, herein incorporated by reference.

[0222] Borano phosphate oligonucleotides are prepared as described in U.S. Pat. Nos. 5,130,302 and 5,177,198, both herein incorporated by reference.

Example 3

[0223] Oligonucleoside Synthesis

[0224] Methylenemethylimino linked oligonucleosides, also identified as MMI linked oligonucleosides, methylenedimethyl-hydrazo linked oligonucleosides, also identified as MDH linked oligonucleosides, and methylenecarbonylamino linked oligonucleosides, also identified as amide-3 linked oligonucleosides, and methyleneaminocarbonyl linked oligo-nucleosides, also identified as amide-4 linked oligonucleo-sides, as well as mixed backbone compounds having, for instance, alternating MMI and P═O or P═S linkages are prepared as described in U.S. Pat. Nos. 5,378,825, 5,386,023, 5,489,677, 5,602,240 and 5,610,289, all of which are herein incorporated by reference.

[0225] Formacetal and thioformacetal linked oligonucleosides are prepared as described in U.S. Pat. Nos. 5,264,562 and 5,264,564, herein incorporated by reference.

[0226] Ethylene oxide linked oligonucleosides are prepared as described in U.S. Pat. No. 5,223,618, herein incorporated by reference.

Example 4

[0227] PNA Synthesis

[0228] Peptide nucleic acids (PNAs) are prepared in accordance with any of the various procedures referred to in Peptide Nucleic Acids (PNA): Synthesis, Properties and Potential Applications, Bioorganic & Medicinal Chemistry, 1996, 4, 5-23. They may also be prepared in accordance with U.S. Pat. Nos. 5,539,082, 5,700,922, and 5,719,262, herein incorporated by reference.

Example 5

[0229] Synthesis of Chimeric Oligonucleotides

[0230] Chimeric oligonucleotides, oligonucleosides or mixed oligonucleotides/oligonucleosides of the invention can be of several different types. These include a first type wherein the “gap” segment of linked nucleosides is positioned between 5′ and 3′ “wing” segments of linked nucleosides and a second “open end” type wherein the “gap” segment is located at either the 3′ or the 5′ terminus of the oligomeric compound. Oligonucleotides of the first type are also known in the art as “gapmers” or gapped oligonucleotides. Oligonucleotides of the second type are also known in the art as “hemimers” or “wingmers”.

[0231] [2′-O-Me]--[2′-deoxy]--[2′-O-Me] Chimeric Phosphorothioate Oligonucleotides

[0232] Chimeric oligonucleotides having 2′-O-alkyl phosphorothioate and 2′-deoxy phosphorothioate oligonucleotide segments are synthesized using an Applied Biosystems automated DNA synthesizer Model 394, as above. Oligonucleotides are synthesized using the automated synthesizer and 2′-deoxy-5′-dimethoxytrityl-3′-O-phosphor-amidite for the DNA portion and 5′-dimethoxytrityl-2′-O-methyl-3′-O-phosphoramidite for 5′ and 3′ wings. The standard synthesis cycle is modified by incorporating coupling steps with increased reaction times for the 5′-dimethoxytrityl-2′-O-methyl-3′-O-phosphoramidite. The fully protected oligonucleotide is cleaved from the support and deprotected in concentrated ammonia (NH₄OH) for 12-16 hr at 55° C. The deprotected oligo is then recovered by an appropriate method (precipitation, column chromatography, volume reduced in vacuo and analyzed spetrophotometrically for yield and for purity by capillary electrophoresis and by mass spectrometry.

[0233] [2′-O-(2-Methoxyethyl)]--[2′-deoxy]--[2′-O-(Methoxyethyl)] Chimeric Phosphorothioate Oligonucleotides

[0234] [2′-O-(2-methoxyethyl)]--[2′-deoxy]--[-2′-O-(methoxyethyl)] chimeric phosphorothioate oligonucleotides were prepared as per the procedure above for the 2′-O-methyl chimeric oligonucleotide, with the substitution of 2′-O-(methoxyethyl) amidites for the 2′-O-methyl amidites.

[0235] [2′-O-(2-Methoxyethyl)Phosphodiester]--[2′-deoxy Phosphorothioate]--[2′-O-(2-Methoxyethyl) Phosphodiester] Chimeric Oligonucleotides

[0236] [2′-O-(2-methoxyethyl phosphodiester]--[2′-deoxy phosphorothioate]--[2′-O-(methoxyethyl) phosphodiester] chimeric oligonucleotides are prepared as per the above procedure for the 2′-O-methyl chimeric oligonucleotide with the substitution of 2′-O-(methoxyethyl) amidites for the 2′-O-methyl amidites, oxidation with iodine to generate the phosphodiester internucleotide linkages within the wing portions of the chimeric structures and sulfurization utilizing 3,H-1,2 benzodithiole-3-one 1,1 dioxide (Beaucage Reagent) to generate the phosphorothioate internucleotide linkages for the center gap.

[0237] Other chimeric oligonucleotides, chimeric oligonucleosides and mixed chimeric oligonucleotides/oligonucleosides are synthesized according to U.S. Pat. No. 5,623,065, herein incorporated by reference.

Example 6

[0238] Oligonucleotide Isolation

[0239] After cleavage from the controlled pore glass solid support and deblocking in concentrated ammonium hydroxide at 55° C. for 12-16 hours, the oligonucleotides or oligonucleosides are recovered by precipitation out of 1 M NH₄OAc with >3 volumes of ethanol. Synthesized oligonucleotides were analyzed by electrospray mass spectroscopy (molecular weight determination) and by capillary gel electrophoresis and judged to be at least 70% full length material. The relative amounts of phosphorothioate and phosphodiester linkages obtained in the synthesis was determined by the ratio of correct molecular weight relative to the −16 amu product (+/−32 +/−48). For some studies oligonucleotides were purified by HPLC, as described by Chiang et al., J. Biol. Chem. 1991, 266, 18162-18171. Results obtained with HPLC-purified material were similar to those obtained with non-HPLC purified material.

Example 7

[0240] Oligonucleotide Synthesis—96 Well Plate Format

[0241] Oligonucleotides were synthesized via solid phase P(III) phosphoramidite chemistry on an automated synthesizer capable of assembling 96 sequences simultaneously in a 96-well format. Phosphodiester internucleotide linkages were afforded by oxidation with aqueous iodine. Phosphorothioate internucleotide linkages were generated by sulfurization utilizing 3H-1,2 benzodithiole-3-one 1,1 dioxide (Beaucage Reagent) in anhydrous acetonitrile. Standard base-protected beta-cyanoethyl-diiso-propyl phosphoramidites were purchased from commercial vendors (e.g. PE-Applied Biosystems, Foster City, Calif., or Pharmacia, Piscataway, N.J.). Non-standard nucleosides are synthesized as per standard or patented methods. They are utilized as base protected beta-cyanoethyldiisopropyl phosphoramidites.

[0242] Oligonucleotides were cleaved from support and deprotected with concentrated NH₄OH at elevated temperature (55-60° C.) for 12-16 hours and the released product then dried in vacuo. The dried product was then re-suspended in sterile water to afford a master plate from which all analytical and test plate samples are then diluted utilizing robotic pipettors.

Example 8

[0243] Oligonucleotide Analysis—96-Well Plate Format

[0244] The concentration of oligonucleotide in each well was assessed by dilution of samples and UV absorption spectroscopy. The full-length integrity of the individual products was evaluated by capillary electrophoresis (CE) in either the 96-well format (Beckman P/ACE™ MDQ) or, for individually prepared samples, on a commercial CE apparatus (e.g., Beckman P/ACE™ 5000, ABI 270). Base and backbone composition was confirmed by mass analysis of the compounds utilizing electrospray-mass spectroscopy. All assay test plates were diluted from the master plate using single and multi-channel robotic pipettors. Plates were judged to be acceptable if at least 85% of the compounds on the plate were at least 85% full length.

Example 9

[0245] Cell Culture and Oligonucleotide Treatment

[0246] The effect of antisense compounds on target nucleic acid expression can be tested in any of a variety of cell types provided that the target nucleic acid is present at measurable levels. This can be routinely determined using, for example, PCR or Northern blot analysis. The following cell types are provided for illustrative purposes, but other cell types can be routinely used, provided that the target is expressed in the cell type chosen. This can be readily determined by methods routine in the art, for example Northern blot analysis, ribonuclease protection assays, or RT-PCR.

[0247] T-24 Cells:

[0248] The human transitional cell bladder carcinoma cell line T-24 was obtained from the American Type Culture Collection (ATCC) (Manassas, Va.). T-24 cells were routinely cultured in complete McCoy's 5A basal media (Invitrogen Corporation, Carlsbad, Calif.) supplemented with 10% fetal calf serum (Invitrogen Corporation, Carlsbad, Calif.), penicillin 100 units per mL, and streptomycin 100 micrograms per mL (Invitrogen Corporation, Carlsbad, Calif.). Cells were routinely passaged by trypsinization and dilution when they reached 90% confluence. Cells were seeded into 96-well plates (Falcon-Primaria #3872) at a density of 7000 cells/well for use in RT-PCR analysis.

[0249] For Northern blotting or other analysis, cells may be seeded onto 100 mm or other standard tissue culture plates and treated similarly, using appropriate volumes of medium and oligonucleotide.

[0250] A549 Cells:

[0251] The human lung carcinoma cell line A549 was obtained from the American Type Culture Collection (ATCC) (Manassas, Va.). A549 cells were routinely cultured in DMEM basal media (Invitrogen Corporation, Carlsbad, Calif.) supplemented with 10% fetal calf serum (Invitrogen Corporation, Carlsbad, Calif.), penicillin 100 units per mL, and streptomycin 100 micrograms per mL (Invitrogen Corporation, Carlsbad, Calif.). Cells were routinely passaged by trypsinization and dilution when they reached 90% confluence.

[0252] NHDF Cells:

[0253] Human neonatal dermal fibroblast (NHDF) were obtained from the Clonetics Corporation (Walkersville, Md.). NHDFs were routinely maintained in Fibroblast Growth Medium (Clonetics Corporation, Walkersville, Md.) supplemented as recommended by the supplier. Cells were maintained for up to 10 passages as recommended by the supplier.

[0254] HEK Cells:

[0255] Human embryonic keratinocytes (HEK) were obtained from the Clonetics Corporation (Walkersville, Md). HEKs were routinely maintained in Keratinocyte Growth Medium (Clonetics Corporation, Walkersville, Md.) formulated as recommended by the supplier. Cells were routinely maintained for up to 10 passages as recommended by the supplier.

[0256] Treatment with Antisense Compounds:

[0257] When cells reached 70% confluency, they were treated with oligonucleotide. For cells grown in 96-well plates, wells were washed once with 100 μL OPTI-MEM™-1 reduced-serum medium (Invitrogen Corporation, Carlsbad, Calif.) and then treated with 130 μL of OPTI-MEM™-1 containing 3.75 μg/mL LIPOFECTIN™ (Invitrogen Corporation, Carlsbad, Calif.) and the desired concentration of oligonucleotide. After 4-7 hours of treatment, the medium was replaced with fresh medium. Cells were harvested 16-24 hours after oligonucleotide treatment.

[0258] The concentration of oligonucleotide used varies from cell line to cell line. To determine the optimal oligonucleotide concentration for a particular cell line, the cells are treated with a positive control oligonucleotide at a range of concentrations. For human cells the positive control oligonucleotide is selected from either ISIS 13920 (TCCGTCATCGCTCCTCAGGG, SEQ ID NO: 1) which is targeted to human H-ras, or ISIS 18078, (GTGCGCGCGAGCCCGAAATC, SEQ ID NO: 2) which is targeted to human Jun-N-terminal kinase-2 (JNK2). Both controls are 2′-O-methoxyethyl gapmers (2′-O-methoxyethyls shown in bold) with a phosphorothioate backbone. For mouse or rat cells the positive control oligonucleotide is ISIS 15770, ATGCATTCTGCCCCCAAGGA, SEQ ID NO: 3, a 2′-O-methoxyethyl gapmer (2′-O-methoxyethyls shown in bold) with a phosphorothioate backbone which is targeted to both mouse and rat c-raf. The concentration of positive control oligonucleotide that results in 80% inhibition of c-Ha-ras (for ISIS 13920) or c-raf (for ISIS 15770) mRNA is then utilized as the screening concentration for new oligonucleotides in subsequent experiments for that cell line. If 80% inhibition is not achieved, the lowest concentration of positive control oligonucleotide that results in 60% inhibition of H-ras or c-raf mRNA is then utilized as the oligonucleotide screening concentration in subsequent experiments for that cell line. If 60% inhibition is not achieved, that particular cell line is deemed as unsuitable for oligonucleotide transfection experiments. The concentrations of antisense oligonucleotides used herein are from 50 nM to 300 nM.

Example 10

[0259] Analysis of Oligonucleotide Inhibition of LIM Domain Kinase 1 Expression

[0260] Antisense modulation of LIM domain kinase 1 expression can be assayed in a variety of ways known in the art. For example, LIM domain kinase 1 mRNA levels can be quantitated by, e.g., Northern blot analysis, competitive polymerase chain reaction (PCR), or real-time PCR (RT-PCR). Real-time quantitative PCR is presently preferred. RNA analysis can be performed on total cellular RNA or poly(A)+ mRNA. The preferred method of RNA analysis of the present invention is the use of total cellular RNA as described in other examples herein. Methods of RNA isolation are taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.1.1-4.2.9 and 4.5.1-4.5.3, John Wiley & Sons, Inc., 1993. Northern blot analysis is routine in the art and is taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.2.1-4.2.9, John Wiley & Sons, Inc., 1996. Real-time quantitative (PCR) can be conveniently accomplished using the commercially available ABI PRISM™ 7700 Sequence Detection System, available from PE-Applied Biosystems, Foster City, Calif. and used according to manufacturer's instructions.

[0261] Protein levels of LIM domain kinase 1 can be quantitated in a variety of ways well known in the art, such as immunoprecipitation, Western blot analysis (immunoblotting), ELISA or fluorescence-activated cell sorting (FACS). Antibodies directed to LIM domain kinase 1 can be identified and obtained from a variety of sources, such as the MSRS catalog of antibodies (Aerie Corporation, Birmingham, Mich.), or can be prepared via conventional antibody generation methods. Methods for preparation of polyclonal antisera are taught in, for example, Ausubel, F. M. et al., (Current Protocols in Molecular Biology, Volume 2, pp. 11.12.1-11.12.9, John Wiley & Sons, Inc., 1997). Preparation of monoclonal antibodies is taught in, for example, Ausubel, F. M. et al., (Current Protocols in Molecular Biology, Volume 2, pp. 11.4.1-11.11.5, John Wiley & Sons, Inc., 1997).

[0262] Immunoprecipitation methods are standard in the art and can be found at, for example, Ausubel, F. M. et al., (Current Protocols in Molecular Biology, Volume 2, pp. 10.16.1-10.16.11, John Wiley & Sons, Inc., 1998). Western blot (immunoblot) analysis is standard in the art and can be found at, for example, Ausubel, F. M. et al., (Current Protocols in Molecular Biology, Volume 2, pp. 10.8.1-10.8.21, John Wiley & Sons, Inc., 1997). Enzyme-linked immunosorbent assays (ELISA) are standard in the art and can be found at, for example, Ausubel, F. M. et al., (Current Protocols in Molecular Biology, Volume 2, pp. 11.2.1-11.2.22, John Wiley & Sons, Inc., 1991).

Example 11

[0263] Poly(A)+ mRNA Isolation

[0264] Poly(A)+ mRNA was isolated according to Miura et al., (Clin. Chem., 1996, 42, 1758-1764). Other methods for poly(A)+ mRNA isolation are taught in, for example, Ausubel, F. M. et al., (Current Protocols in Molecular Biology, Volume 1, pp. 4.5.1-4.5.3, John Wiley & Sons, Inc., 1993). Briefly, for cells grown on 96-well plates, growth medium was removed from the cells and each well was washed with 200 μL cold PBS. 60 μL lysis buffer (10 mM Tris-HCl, pH 7.6, 1 mM EDTA, 0.5 M NaCl, 0.5% NP-40, 20 mM vanadyl-ribonucleoside complex) was added to each well, the plate was gently agitated and then incubated at room temperature for five minutes. 55 μL of lysate was transferred to Oligo d(T) coated 96-well plates (AGCT Inc., Irvine Calif.). Plates were incubated for 60 minutes at room temperature, washed 3 times with 200 μL of wash buffer (10 mM Tris-HCl pH 7.6, 1 mM EDTA, 0.3 M NaCl). After the final wash, the plate was blotted on paper towels to remove excess wash buffer and then air-dried for 5 minutes. 60 μL of elution buffer (5 mM Tris-HCl pH 7.6), preheated to 70° C., was added to each well, the plate was incubated on a 90° C. hot plate for 5 minutes, and the eluate was then transferred to a fresh 96-well plate.

[0265] Cells grown on 100 mm or other standard plates may be treated similarly, using appropriate volumes of all solutions.

Example 12

[0266] Total RNA Isolation

[0267] Total RNA was isolated using an RNEASY 96™ kit and buffers purchased from Qiagen Inc. (Valencia, Calif.) following the manufacturer's recommended procedures. Briefly, for cells grown on 96-well plates, growth medium was removed from the cells and each well was washed with 200 μL cold PBS. 150 μL Buffer RLT was added to each well and the plate vigorously agitated for 20 seconds. 150 μL of 70% ethanol was then added to each well and the contents mixed by pipetting three times up and down. The samples were then transferred to the RNEASY 96™ well plate attached to a QIAVAC™ manifold fitted with a waste collection tray and attached to a vacuum source. Vacuum was applied for 1 minute. 500 μL of Buffer RW1 was added to each well of the RNEASY 96™ plate and incubated for 15 minutes and the vacuum was again applied for 1 minute. An additional 500 μL of Buffer RW1 was added to each well of the RNEASY 96™ plate and the vacuum was applied for 2 minutes. 1 mL of Buffer RPE was then added to each well of the RNEASY 96™plate and the vacuum applied for a period of 90 seconds. The Buffer RPE wash was then repeated and the vacuum was applied for an additional 3 minutes. The plate was then removed from the QIAVAC™ manifold and blotted dry on paper towels. The plate was then re-attached to the QIAVAC™ manifold fitted with a collection tube rack containing 1.2 mL collection tubes. RNA was then eluted by pipetting 170 μL water into each well, incubating 1 minute, and then applying the vacuum for 3 minutes.

[0268] The repetitive pipetting and elution steps may be automated using a QIAGEN Bio-Robot 9604 (Qiagen, Inc., Valencia Calif.). Essentially, after lysing of the cells on the culture plate, the plate is transferred to the robot deck where the pipetting, DNase treatment and elution steps are carried out.

Example 13

[0269] Real-Time Quantitative PCR Analysis of LIM Domain Kinase 1 mRNA Levels

[0270] Quantitation of LIM domain kinase 1 mRNA levels was determined by real-time quantitative PCR using the ABI PRISM™ 7700 Sequence Detection System (PE-Applied Biosystems, Foster City, Calif.) according to manufacturer's instructions. This is a closed-tube, non-gel-based, fluorescence detection system which allows high-throughput quantitation of polymerase chain reaction (PCR) products in real-time. As opposed to standard PCR in which amplification products are quantitated after the PCR is completed, products in real-time quantitative PCR are quantitated as they accumulate. This is accomplished by including in the PCR reaction an oligonucleotide probe that anneals specifically between the forward and reverse PCR primers, and contains two fluorescent dyes. A reporter dye (e.g., FAM or JOE, obtained from either PE-Applied Biosystems, Foster City, Calif., Operon Technologies Inc., Alameda, Calif. or Integrated DNA Technologies Inc., Coralville, Iowa) is attached to the 5′end of the probe and a quencher dye (e.g., TAMRA, obtained from either PE-Applied Biosystems, Foster City, Calif., Operon Technologies Inc., Alameda, Calif. or Integrated DNA Technologies Inc., Coralville, Iowa) is attached to the 3′ end of the probe. When the probe and dyes are intact, reporter dye emission is quenched by the proximity of the 3′ quencher dye. During amplification, annealing of the probe to the target sequence creates a substrate that can be cleaved by the 5′-exonuclease activity of Taq polymerase. During the extension phase of the PCR amplification cycle, cleavage of the probe by Taq polymerase releases the reporter dye from the remainder of the probe (and hence from the quencher moiety) and a sequence-specific fluorescent signal is generated. With each cycle, additional reporter dye molecules are cleaved from their respective probes, and the fluorescence intensity is monitored at regular intervals by laser optics built into the ABI PRISM™ 7700 Sequence Detection System. In each assay, a series of parallel reactions containing serial dilutions of mRNA from untreated control samples generates a standard curve that is used to quantitate the percent inhibition after antisense oligonucleotide treatment of test samples.

[0271] Prior to quantitative PCR analysis, primer-probe sets specific to the target gene being measured are evaluated for their ability to be “multiplexed” with a GAPDH amplification reaction. In multiplexing, both the target gene and the internal standard gene GAPDH are amplified concurrently in a single sample. In this analysis, mRNA isolated from untreated cells is serially diluted. Each dilution is amplified in the presence of primer-probe sets specific for GAPDH only, target gene only (“single-plexing”), or both (multiplexing). Following PCR amplification, standard curves of GAPDH and target mRNA signal as a function of dilution are generated from both the single-plexed and multiplexed samples. If both the slope and correlation coefficient of the GAPDH and target signals generated from the multiplexed samples fall within 10% of their corresponding values generated from the single-plexed samples, the primer-probe set specific for that target is deemed multiplexable. Other methods of PCR are also known in the art.

[0272] PCR reagents were obtained from Invitrogen Corporation, (Carlsbad, Calif.). RT-PCR reactions were carried out by adding 20 μL PCR cocktail (2.5×PCR buffer (—MgCl2), 6.6 mM MgCl2, 375 μM each of dATP, dCTP, dCTP and dGTP, 375 nM each of forward primer and reverse primer, 125 nM of probe, 4 Units RNAse inhibitor, 1.25 Units PLATINUM® Taq, 5 Units MuLV reverse transcriptase, and 2.5×ROX dye) to 96-well plates containing 30 μL total RNA solution. The RT reaction was carried out by incubation for 30 minutes at 48° C. Following a 10 minute incubation at 95° C. to activate the PLATINUM® Taq, 40 cycles of a two-step PCR protocol were carried out: 95° C. for 15 seconds (denaturation) followed by 60° C. for 1.5 minutes (annealing/extension).

[0273] Gene target quantities obtained by real time RT-PCR are normalized using either the expression level of GAPDH, a gene whose expression is constant, or by quantifying total RNA using RiboGreenTM (Molecular Probes, Inc. Eugene, Oreg.). GAPDH expression is quantified by real time RT-PCR, by being run simultaneously with the target, multiplexing, or separately. Total RNA is quantified using RiboGreenTM RNA quantification reagent from Molecular Probes. Methods of RNA quantification by RiboGreenTM are taught in Jones, L. J., et al, (Analytical Biochemistry, 1998, 265, 368-374).

[0274] In this assay, 170 μL of RiboGreenTM working reagent (RiboGreenTM reagent diluted 1:350 in 10 mM Tris-HCl, 1 mM EDTA, pH 7.5) is pipetted into a 96-well plate containing 30 μL purified, cellular RNA. The plate is read in a CytoFluor 4000 (PE Applied Biosystems) with excitation at 480 nm and emission at 520 nm.

[0275] Probes and primers to human LIM domain kinase 1 were designed to hybridize to a human LIM domain kinase 1 sequence, using published sequence information (GenBank accession number D26309.1, incorporated herein as SEQ ID NO:4). For human LIM domain kinase 1 the PCR primers were: forward primer: TGGCACCGAGCACTCACA (SEQ ID NO: 5) reverse primer: CGACGTGGATGGAATTCTTCA (SEQ ID NO: 6) and the PCR probe was: FAM-CCGTCCGCGTCCAGGGAGTG-TAMRA (SEQ ID NO: 7) where FAM is the fluorescent dye and TAMRA is the quencher dye. For human GAPDH the PCR primers were: forward primer: GAAGGTGAAGGTCGGAGTC(SEQ ID NO:8) reverse primer: GAAGATGGTGATGGGATTTC (SEQ ID NO:9) and the PCR probe was: 5′ JOE-CAAGCTTCCCGTTCTCAGCC- TAMRA 3′ (SEQ ID NO: 10) where JOE is the fluorescent reporter dye and TAMRA is the quencher dye.

Example 14

[0276] Northern Blot Analysis of LIM Domain Kinase 1 mRNA Levels

[0277] Eighteen hours after antisense treatment, cell monolayers were washed twice with cold PBS and lysed in 1 mL RNAZOL™ (TEL-TEST “B” Inc., Friendswood, Tex.). Total RNA was prepared following manufacturer's recommended protocols. Twenty micrograms of total RNA was fractionated by electrophoresis through 1.2% agarose gels containing 1.1% formaldehyde using a MOPS buffer system (AMRESCO, Inc. Solon, Ohio.). RNA was transferred from the gel to HYBOND™-N+nylon membranes (Amersham Pharmacia Biotech, Piscataway, N.J.) by overnight capillary transfer using a Northern/Southern Transfer buffer system (TEL-TEST “B” Inc., Friendswood, Tex.). RNA transfer was confirmed by UV visualization. Membranes were fixed by UV cross-linking using a STRATALINKER™ UV Crosslinker 2400 (Stratagene, Inc, La Jolla, Calif.) and then probed using QUICKHYB™ hybridization solution (Stratagene, La Jolla, Calif.) using manufacturer's recommendations for stringent conditions.

[0278] To detect human LIM domain kinase 1, a human LIM domain kinase 1 specific probe was prepared by PCR using the forward primer TGGCACCGAGCACTCACA (SEQ ID NO: 5) and the reverse primer CGACGTGGATGGAATTCTTCA (SEQ ID NO: 6). To normalize for variations in loading and transfer efficiency membranes were stripped and probed for human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) RNA (Clontech, Palo Alto, Calif.).

[0279] Hybridized membranes were visualized and quantitated using a PHOSPHORIMAGER™ and IMAGEQUANT™ Software V3.3 (Molecular Dynamics, Sunnyvale, Calif.). Data was normalized to GAPDH levels in untreated controls.

Example 15

[0280] Antisense Inhibition of Human LIM Domain Kinase 1 Expression by Chimeric Phosphorothioate Oligonucleotides Having 2′-MOE Wings and a Deoxy Gap

[0281] In accordance with the present invention, a series of oligonucleotides were designed to target different regions of the human 1 LIM domain kinase 1 RNA, using published sequences (GenBank accession number D26309.1, representing LIMK hypothetical-1, incorporated herein as SEQ ID NO: 4, GenBank accession number NM_(—)002314.2, representing the main mRNA of LIM domain kinase 1, incorporated herein as SEQ ID NO: 11, GenBank accession number NM_(—)016735.1, representing dLIMK, incorporated herein as SEQ ID NO: 12, GenBank accession number AW452413.1, the complement of which is incorporated herein as SEQ ID NO: 13, residues 95000-134000 of GenBank accession number NT_(—)025776.4, representing a genomic sequence of LIM domain kinase 1, incorporated herein as SEQ ID NO: 14, and GenBank accession number BF837468.1, the complement of which is incorporated herein as SEQ ID NO: 15). The oligonucleotides are shown in Table 1. “Target site” indicates the first (5′-most) nucleotide number on the particular target sequence to which the oligonucleotide binds. All compounds in Table 1 are chimeric oligonucleotides (“gapmers”) 20 nucleotides in length, composed of a central “gap” region consisting of ten 2′-deoxynucleotides, which is flanked on both sides (5′ and 3′ directions) by five-nucleotide “wings”. The wings are composed of 2′-methoxyethyl (2′-MOE)nucleotides. The internucleoside (backbone) linkages are phosphorothioate (P═S) throughout the oligonucleotide. All cytidine residues are 5-methylcytidines. The compounds were analyzed for their effect on human LIM domain kinase 1 mRNA levels by quantitative real-time PCR as described in other examples herein. Data are averages from two experiments in which A549 cells were treated with the oligonucleotides of the present invention. The positive control for each datapoint is identified in the table by sequence ID number. If present, “N.D.” indicates “no data”. TABLE 1 Inhibition of human LIM domain kinase 1 mRNA levels by chimeric phosphorothioate oligonucleotides having 2′-MOE wings and a deoxy gap TARGET CONTROL SEQ ID TARGET SEQ ID SEQ ID ISIS # REGION NO SITE SEQUENCE % INHIB NO NO 206161 Start 4 84 aacctcatgcactcgacggg 82 16 1 Codon 206162 3′UTR 4 2979 ctgctgcagccgtatgaaaa 74 17 1 206163 Start 11 157 gtcaacctcatgcactcgac 79 18 1 Codon 206164 Coding 11 175 caggtgcaacaaagtagcgt 75 19 1 206165 Coding 11 180 ccctccaggtgcaacaaagt 22 20 1 206166 Coding 11 185 ttcttccctccaggtgcaac 70 21 1 206167 Coding 11 190 atacgttcttccctccaggt 85 22 1 206168 Coding 11 196 tctcccatacgttcttccct 36 23 1 206169 Coding 11 201 cttcctctcccatacgttct 67 24 1 206170 Coding 11 213 gcaactcgcttccttcctct 43 25 1 206171 Coding 11 268 agggcctggaggtactggcc 83 26 1 206172 Coding 11 370 tccttcttgcagaagagctg 76 27 1 206173 Coding 11 375 agtagtccttcttgcagaag 71 28 1 206174 Coding 11 380 ggcccagtagtccttcttgc 87 29 1 206175 Coding 11 385 tagcgggcccagtagtcctt 83 30 1 206176 Coding 11 390 cgccatagcgggcccagtag 87 31 1 206177 Coding 11 541 agcttggagtgctccaccag 86 32 1 206178 Coding 11 546 agtacagcttggagtgctcc 69 33 1 206179 Coding 11 557 gcagtgcccgcagtacagct 75 34 1 206180 Coding 11 784 acatctgggctcatgcagcc 77 35 1 206181 Coding 11 789 tcttcacatctgggctcatg 54 36 1 206182 Coding 11 794 ggaattcttcacatctgggc 86 37 1 206183 Coding 11 799 tggatggaattcttcacatc 79 38 1 206184 Coding 11 823 atttccaagatccggtctcc 74 39 1 206185 Coding 11 830 gccattgatttccaagatcc 86 40 1 206186 Coding 11 835 ggcgtgccattgatttccaa 72 41 1 206187 Coding 11 903 tcagctggagcaggcggctg 60 42 1 206188 Coding 11 910 tcgagggtcagctggagcag 71 43 1 206189 Coding 11 1020 gtttctgccgggcagagctg 81 44 1 206190 Coding 11 1025 gacaggtttctgccgggcag 88 45 1 206191 Coding 11 1030 ctcaagacaggtttctgccg 62 46 1 206192 Coding 11 1035 agctcctcaagacaggtttc 54 47 1 206193 Coding 11 1099 aggtccttgcgctgggaggc 67 48 1 206194 Coding 11 1104 gacccaggtccttgcgctgg 41 49 1 206195 Coding 11 1225 tcacggtgtgtcaccttgat 73 50 1 206196 Coding 11 1306 accttcacctccttgaggaa 47 51 1 206197 Coding 11 1316 gcatcgcatgaccttcacct 71 52 1 206198 Coding 11 1321 tccaggcatcgcatgacctt 86 53 1 206199 Coding 11 1434 tgtccatgctcttgatgatg 68 54 1 206200 Coding 11 1439 ctggctgtccatgctcttga 76 55 1 206201 Coding 11 1498 aggtaggccatccctgatgc 74 56 1 206202 Coding 11 1526 gtctcggtggatgatgttca 69 57 1 206203 Coding 11 1540 ttgtgggagttgaggtctcg 55 58 1 206204 Coding 11 1566 cattcttgttctcgcggacc 81 59 1 206205 Coding 11 1937 aaaggatggcctcttctcgg 74 60 1 206206 Coding 11 2047 cgccggtaggtctcccagaa 28 61 1 206207 Stop 11 2101 gtggccctggctcagtcggg 80 62 1 Codon 206208 3′UTR 11 2164 ccacctcccgcggaggaatc 79 63 1 206209 3′UTR 11 2275 ctctgggccacggcagagag 80 64 1 206210 3′UTR 11 2460 cctccagggagctatggacc 83 65 1 206211 3′UTR 11 2535 tctgccctgatgtgcgccct 59 66 1 206212 3′UTR 11 2580 actccccatttggttcctgg 91 67 1 206213 3′UTR 11 2783 ccctgaggcttggtacctgc 14 68 1 206214 3′UTR 11 2830 ccggccagggcaccgcagct 87 69 1 206215 3′UTR 11 3056 gcagctgctgctgcagccgt 75 70 1 206216 3′UTR 11 3153 tcacagttttaatacttcct 0 71 1 206217 3′UTR 11 3158 aagcttcacagttttaatac 68 72 1 206218 3′UTR 11 3163 tgagaaagcttcacagtttt 43 73 1 206219 3′UTR 11 3168 tgcactgagaaagcttcaca 89 74 1 206220 3′UTR 11 3173 caaagtgcactgagaaagct 59 75 1 206221 3′UTR 11 3227 tcccacctccctaagtcatg 17 76 1 206222 3′UTR 11 3233 ggtgggtcccacctccctaa 63 77 1 206223 3′UTR 12 1160 ggtgtgtcaccttgatagcc 62 78 1 206224 3′UTR 12 2989 gctgctgcagccgtatgaaa 66 79 1 206225 Genomic 13 224 ggagtgaaagcacccagctg 74 80 1 206226 Genomic 13 252 taggtcccacagcccctgag 9 81 1 206227 Intron 14 5523 ctcagacccttgagttactg 68 82 1 206228 Intron 14 6884 actgcctctcgggttcaagc 69 83 1 206229 Intron: 14 12569 gtcacaacacctgcaggaga 66 84 1 Exon Junction 206230 Intron: 14 12730 ccaggtgagtgtgcaaggca 74 85 1 Exon Junction 206231 Intron: 14 14979 gcagtgcccgctgcaagagc 76 86 1 Exon Junction 206232 Intron: 14 22962 atgctgcagctcctcctggg 81 87 1 Exon Junction 206233 Intron: 14 23018 ggtccttgcgctgggaggcc 80 88 1 Exon Junction 206234 Intron: 14 31749 cattcttgttctgggaaggt 61 89 1 Exon Junction 206235 Intron 14 32241 tagctgggattacagtacta 53 90 1 206236 Intron 14 36586 ctggacctacctcgcacagg 43 91 1 206237 Intron 15 63 gtcctgccccacagactcag 62 92 1 206238 Intron 15 83 gcatgaaggcactacctccc 53 93 1

[0282] As shown in Table 1, SEQ ID NOs 16, 17, 18, 19, 21, 22, 24, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 37, 38, 39, 40, 41, 43, 44, 45, 48, 50, 52, 53, 54, 55, 56, 57, 59, 60, 62, 63, 64, 65, 67, 69, 70, 72, 74, 79, 80, 82, 83, 84, 85, 86, 87 and 88 demonstrated at least 65% inhibition of human LIM domain kinase 1 expression in this assay and are therefore peferred. The target sites to which these preferred sequences are complementary are herein referred to as “preferred target regions” and are therefore preferred sites for targeting by compounds of the present invention. These preferred target regions are shown in Table 2. The sequences represent the reverse complement of the preferred antisense compounds shown in Table 1. “Target site” indicates the first (5′-most) nucleotide number of the corresponding target nucleic acid. Also shown in Table 2 is the species in which each of the preferred target regions was found. TABLE 2 Sequence and position of preferred target regions identified in LIM domain kinase 1. TARGET SEQ ID TARGET REV COMP SEQ ID SITE ID NO SITE SEQUENCE OF SEQ ID ACTIVE IN NO 123815 4 84 cccgtcgagtgcatgaggtt 16 H. sapiens 94 123816 4 2979 ttttcatacggctgcagcag 17 H. sapiens 95 123817 11 157 gtcgagtgcatgaggttgac 18 H. sapiens 96 123818 11 175 acgctactttgttgcacctg 19 H. sapiens 97 123820 11 185 gttgcacctggagggaagaa 21 H. sapiens 98 123821 11 190 acctggagggaagaacgtat 22 H. sapiens 99 123823 11 201 agaacgtatgggagaggaag 24 H. sapiens 100 123825 11 268 ggccagtacctccaggccct 26 H. sapiens 101 123826 11 370 cagctcttctgcaagaagga 27 H. sapiens 102 123827 11 375 cttctgcaagaaggactact 28 H. sapiens 103 123828 11 380 gcaagaaggactactgggcc 29 H. sapiens 104 123829 11 385 aaggactactgggcccgcta 30 H. sapiens 105 123830 11 390 ctactgggcccgctatggcg 31 H. sapiens 106 123831 11 541 ctggtggagcactccaagct 32 H. sapiens 107 123832 11 546 qgagcactccaagctgtact 33 H. sapiens 108 123833 11 557 agctgtactgcgggcactgc 34 H. sapiens 109 123834 11 784 ggctgcatgagcccagatgt 35 H. sapiens 110 123836 11 794 gcccagatgtgaagaattcc 37 H. sapiens 111 123837 11 799 gatgtgaagaattccatcca 38 H. sapiens 112 123838 11 823 ggagaccggatcttggaaat 39 H. sapiens 113 123839 11 830 ggatcttggaaatcaatggc 40 H. sapiens 114 123840 11 835 ttggaaatcaatggcacgcc 41 H. sapiens 115 123842 11 910 ctgctccagctgaccctcga 43 H. sapiens 116 123843 11 1020 cagctctgcccggcagaaac 44 H. sapiens 117 123844 11 1025 ctgcccggcagaaacctgtc 45 H. sapiens 118 123847 11 1099 gcctcccagcgcaaggacct 48 H. sapiens 119 123849 11 1225 atcaaggtgacacaccgtga 50 H. sapiens 120 123851 11 1316 aggtgaaggtcatgcgatgc 52 H. sapiens 121 123852 11 1321 aaggtcatgcgatgcctgga 53 H. sapiens 122 123853 11 1434 catcatcaagagcatggaca 54 H. sapiens 123 123854 11 1439 tcaagagcatggacagccag 55 H. sapiens 124 123855 11 1498 gcatcagggatggcctacct 56 H. sapiens 125 123856 11 1526 tgaacatcatccaccgagac 57 H. sapiens 126 123858 11 1566 ggtccgcgagaacaagaatg 59 H. sapiens 127 123859 11 1937 ccgagaagaggccatccttt 60 H. sapiens 128 123861 11 2101 cccgactgagccagqgccac 62 H. sapiens 129 123862 11 2164 gattcctccgcgggaggtgg 63 H. sapiens 130 123863 11 2275 ctctctgccgtggcccagag 64 H. sapiens 131 123864 11 2460 ggtccatagctccctggagg 65 H. sapiens 132 123866 11 2580 ccaggaaccaaatggggagt 67 H. sapiens 133 123868 11 2830 agctgcggtgccctggccgg 69 H. sapiens 134 123869 11 3056 acggctgcagcagcagctgc 70 H. sapiens 135 123871 11 3158 gtattaaaactgtgaagctt 72 H. sapiens 136 123873 11 3168 tgtgaagctttdtcagtgca 74 H. sapiens 137 123878 12 2989 tttcatacggctgcagcagc 79 H. sapiens 138 123879 13 224 cagctgggtgctttcactcc 80 H. sapiens 139 123881 14 5523 cagtaactcaagggtctgag 82 H. sapiens 140 123882 14 6884 gcttgaacccgagaggcagt 83 H. sapiens 141 123883 14 12569 tctcctgcaggtgttgtgac 84 H. sapiens 142 123884 14 12730 tgccttgcacactcacctgg 85 H. sapiens 143 123885 14 14979 gctcttgcagcgggcactgc 86 H. sapiens 144 123886 14 22962 cccaggaggagctgcagcat 87 H. sapiens 145 123887 14 23018 ggcctcccagcgcaaggacc 88 H. sapiens 146

[0283] As these “preferred target regions” have been found by expermentation to be open to, and accessible for, hybridzation with the antisense compounds of the present invention, one of skill in the art will recognize or be able to ascertain, using no more than routine experimentation, futher embodiments of the invention that encompass other compounds that specifically hybridize to these sites and consequently inhibit the expression of LIM domain kinase 1.

[0284] In one embodiment, the “preferred target region” may be employed in screening candidate antisense compounds. “Candidate antisense compounds” are those that inhibit the expression of a nucleic acid molecule encoding LIM domain kinase b 1and which comprise at least an 8-nucleobase portion which is complementary to a preferred target region. The method comprises the steps of contacting a preferred target region of a nucleic acid molecule encoding LIM domain kinase 1 with one or more candidate antisense compounds, and selecting for one or more candidate antisense compounds which inhibit the expression of a nucleic acid molecule encoding LIM domain kinase 1. Once it is shown that the candidate antisense compound or compounds are capable of inhibiting the expression of a nucleic acid molecule encoding LIM domain kinase 1, the candidate antisense compound may be employed as an antisense compound in accordance with the present invention.

[0285] According to the present invention, antisense compounds include ribozymes, external guide sequence (EGS) oligonucleotides (oligozymes), and other short catalytic RNAs or catalytic oligonucleotides which hybridize to the target nucleic acid and modulate its expression.

Example 16

[0286] Western Blot Analysis of LIM Domain Kinase 1 Protein Levels

[0287] Western blot analysis (immunoblot analysis) is carried out using standard methods. Cells are harvested 16-20 h after oligonucleotide treatment, washed once with PBS, suspended in Laemmli buffer (100 ul/well), boiled for 5 minutes and loaded on a 16% SDS-PAGE gel. Gels are run for 1.5 hours at 150 V, and transferred to membrane for western blotting. Appropriate primary antibody directed to LIM domain kinase 1 is used, with a radiolabeled or fluorescently labeled secondary antibody directed against the primary antibody species. Bands are visualized using a PHOSPHORIMAGER™ (Molecular Dynamics, Sunnyvale Calif.).

Example 17

[0288] Targeting of Individual Oligonucleotides to Specific Variants of LIM Domain Kinase 1.

[0289] It is advantageous to selectively inhibit the expression of one or more variants of LIM domain kinase 1. Consequently, in one embodiment of the present invention are oligonucleotides that selectively target, hybridize to, and specifically inhibit one or more, but fewer than all of the variants of LIM domain kinase 1. A summary of the target sites of the LIM domain kinase 1 main mRNA and variants is shown in Table 3 and includes GenBank accession number D26309.1, representing LIMK hypothetical-1, incorporated herein as SEQ ID NO: 4; and GenBank accession number NM_(—)002314.2, representing the main mRNA of LIM domain kinase (entered in Table 3 as LIMK), incorporated herein as SEQ ID NO: 11. LIMK hypothetical-1 and LIMK can be specifically inhibited using the oligonucleotides in Table 3, relative to GenBank accession number NM_(—)016735.1, representing dLIMK, incorporated herein as SEQ ID NO: 12, but not listed in Table 3. TABLE 3 Targeting of individual oligonucleotides to specific variants of LIM domain kinase 1 VARIANT OLIGO SEQ TARGET SEQ ISIS # ID NO. SITE VARIANT ID NO. 206191 46 960 LIMK hypothetical-1 4 206191 46 1030 LIMK 11 206192 47 965 LIMK hypothetical-1 4 206192 47 1035 LIMK 11 206193 48 1029 LIMK hypothetical-1 4 206193 48 1099 LIMK 11 206194 49 1034 LIMK hypothetical-1 4 206194 49 1104 LIMK 11 206233 88 1028 LIMK hypothetical-1 4 206233 88 1098 LIMK 11

Example 18

[0290] Antisense Inhibition of LIM Domain Kinase 1 Expression-Dose Response in A549 Cells

[0291] In accordance with the present invention, three of the oligonucleotides targeted to LIM domain kinase 1: ISIS 206212 (ACTCCCCATTTGGTTCCTGG, SEQ ID NO: 67), ISIS 206219 (TGCACTGAGAAAGCTTCACA, SEQ ID NO: 74) and ISIS 206214 (CCGGCCAGGGCACCGCAGCT, SEQ ID NO: 69) were investigated in a dose response experiment. The scrambled control oligonucleotides used in the dose response study were ISIS 129695 (TTCTACCTCGCGCGATTTAC, SEQ ID NO: 147), and ISIS 114845 (TACGTCCGGAGGCGTACGCC, SEQ ID NO: 148).

[0292] All compounds are chimeric oligonucleotides (“gapmers”) 20 nucleotides in length, composed of a central “gap” region consisting of ten 2′-deoxynucleotides, which is flanked on both sides (5′ and 3′ directions) by five-nucleotide “wings”. The wings are composed of 2′-methoxyethyl (2′-MOE) nucleotides. The internucleoside (backbone) linkages are phosphorothioate (P═S) throughout the oligonucleotides. All cytidine residues are 5-methylcytidines.

[0293] In the dose-response experiment, with mRNA levels as the endpoint, A549 cells were treated with ISIS 206212, ISIS 206219, ISIS 206214 and control oligonucleotides at doses of 6, 25, 100 and 400 nM oligonucleotide. Data were obtained by real-time quantitative PCR as described in other examples herein and are averaged from two experiments with mRNA levels in the treatment groups, (including the scrambled control), being normalized to an untreated control group. The data are shown in Table 4. TABLE 4 Inhibition of LIM domain kinase 1 mRNA levels by chimeric phosphorothioate oligonucleotides having 2′-MOE wings and a deoxy gap: Dose Response Percent Inhibition of LIM Domain Kinase 1 mRNA Levels Dose ISIS NO. 6 nM 25 nM 100 nM 400 nM 206212 0 73 91 97 206219 0 67 84 93 206214 0 46 84 98 129695 2 0 12 62 114845 0 0 0 24

[0294] From this data set, it is evident that ISIS 206212, ISIS 206219, and ISIS 206214 were capable of reducing LIM domain kinase 1 mRNA levels in a dose-dependent manner.

1 148 1 20 DNA Artificial Sequence Antisense Oligonucleotide 1 tccgtcatcg ctcctcaggg 20 2 20 DNA Artificial Sequence Antisense Oligonucleotide 2 gtgcgcgcga gcccgaaatc 20 3 20 DNA Artificial Sequence Antisense Oligonucleotide 3 atgcattctg cccccaagga 20 4 3262 DNA H. sapiens CDS (96)...(2039) 4 gcgccgagcc ggtttccccg ccggtgtccg agaggcgccc ccggcccggc cgcccccagc 60 cccagccccg ccgggccccg ccccccgtcg agtgc atg agg ttg acg cta ctt 113 Met Arg Leu Thr Leu Leu 1 5 tgt tgc acc tgg agg gaa gaa cgt atg gga gag gaa gga agc gag ttg 161 Cys Cys Thr Trp Arg Glu Glu Arg Met Gly Glu Glu Gly Ser Glu Leu 10 15 20 ccc gtg tgt gca agc tgc ggc cag agg atc tat gat ggc cag tac ctc 209 Pro Val Cys Ala Ser Cys Gly Gln Arg Ile Tyr Asp Gly Gln Tyr Leu 25 30 35 cag gcc ctg aac gcg gac tgg cac gca gac tgc ttc agg tgt tgt gac 257 Gln Ala Leu Asn Ala Asp Trp His Ala Asp Cys Phe Arg Cys Cys Asp 40 45 50 tgc agt gcc tcc ctg tcg cac cag tac tat gag aag gat ggg cag ctc 305 Cys Ser Ala Ser Leu Ser His Gln Tyr Tyr Glu Lys Asp Gly Gln Leu 55 60 65 70 ttc tgc aag aag gac tac tgg gcc cgc tat ggc gag tcc tgc cat ggg 353 Phe Cys Lys Lys Asp Tyr Trp Ala Arg Tyr Gly Glu Ser Cys His Gly 75 80 85 tgc tct gag caa atc acc aag gga ctg gtt atg gtg gct ggg gag ctg 401 Cys Ser Glu Gln Ile Thr Lys Gly Leu Val Met Val Ala Gly Glu Leu 90 95 100 aag tac cac ccc gag tgt ttc atc tgc ctc acg tgt ggg acc ttt atc 449 Lys Tyr His Pro Glu Cys Phe Ile Cys Leu Thr Cys Gly Thr Phe Ile 105 110 115 ggt gac ggg gac acc tac acg ctg gtg gag cac tcc aag ctg tac tgc 497 Gly Asp Gly Asp Thr Tyr Thr Leu Val Glu His Ser Lys Leu Tyr Cys 120 125 130 ggg cac tgc tac tac cag act gtg gtg acc ccc gtc atc gag cag atc 545 Gly His Cys Tyr Tyr Gln Thr Val Val Thr Pro Val Ile Glu Gln Ile 135 140 145 150 ctg cct gac tcc cct ggc tcc cac ctg ccc cac acc gtc acc ctg gtg 593 Leu Pro Asp Ser Pro Gly Ser His Leu Pro His Thr Val Thr Leu Val 155 160 165 tcc atc cca gcc tca tct cat ggc aag cgt gga ctt tca gtc tcc att 641 Ser Ile Pro Ala Ser Ser His Gly Lys Arg Gly Leu Ser Val Ser Ile 170 175 180 gac ccc ccg cac ggc cca ccg ggc tgt ggc acc gag cac tca cac acc 689 Asp Pro Pro His Gly Pro Pro Gly Cys Gly Thr Glu His Ser His Thr 185 190 195 gtc cgc gtc cag gga gtg gat ccg ggc tgc atg agc cca gat gtg aag 737 Val Arg Val Gln Gly Val Asp Pro Gly Cys Met Ser Pro Asp Val Lys 200 205 210 aat tcc atc cac gtc gga gac cgg atc ttg gaa atc aat ggc acg ccc 785 Asn Ser Ile His Val Gly Asp Arg Ile Leu Glu Ile Asn Gly Thr Pro 215 220 225 230 atc cga aat gtg ccc ctg gac gag att gac ctg ctg att cag gaa acc 833 Ile Arg Asn Val Pro Leu Asp Glu Ile Asp Leu Leu Ile Gln Glu Thr 235 240 245 agc cgc ctg ctc cag ctg acc ctc gag cat gac cct cac gat aca ctg 881 Ser Arg Leu Leu Gln Leu Thr Leu Glu His Asp Pro His Asp Thr Leu 250 255 260 ggc cac ggg ctg ggg cct gag acc agc ccc ctg agc tct ccg gct tat 929 Gly His Gly Leu Gly Pro Glu Thr Ser Pro Leu Ser Ser Pro Ala Tyr 265 270 275 act ccc agc ggg gag gcg ggc agc tct gcc cgg cag aaa cct gtc ttg 977 Thr Pro Ser Gly Glu Ala Gly Ser Ser Ala Arg Gln Lys Pro Val Leu 280 285 290 agg agc tgc agc atc gac agg tct ccg ggc gct ggc tca ctg ggc tcc 1025 Arg Ser Cys Ser Ile Asp Arg Ser Pro Gly Ala Gly Ser Leu Gly Ser 295 300 305 310 ccg gcc tcc cag cgc aag gac ctg ggt cgc tct gag tcc ctc cgc gta 1073 Pro Ala Ser Gln Arg Lys Asp Leu Gly Arg Ser Glu Ser Leu Arg Val 315 320 325 gtc tgc cgg cca cac cgc atc ttc cgg ccg tcg gac ctc atc cac ggg 1121 Val Cys Arg Pro His Arg Ile Phe Arg Pro Ser Asp Leu Ile His Gly 330 335 340 gag gtg ctg ggc aag ggc tgc ttc ggc cag gct atc aag gtg aca cac 1169 Glu Val Leu Gly Lys Gly Cys Phe Gly Gln Ala Ile Lys Val Thr His 345 350 355 cgt gag aca ggt gag gtg atg gtg atg aag gag ctg atc cgg ttc gac 1217 Arg Glu Thr Gly Glu Val Met Val Met Lys Glu Leu Ile Arg Phe Asp 360 365 370 gag gag acc cag agg acg ttc ctc aag gag gtg aag gtc atg cga tgc 1265 Glu Glu Thr Gln Arg Thr Phe Leu Lys Glu Val Lys Val Met Arg Cys 375 380 385 390 ctg gaa cac ccc aac gtg ctc aag ttc atc ggg gtg ctc tac aag gac 1313 Leu Glu His Pro Asn Val Leu Lys Phe Ile Gly Val Leu Tyr Lys Asp 395 400 405 aag agg ctc aac ttc atc act gag tac atc aag ggc ggc acg ctc cgg 1361 Lys Arg Leu Asn Phe Ile Thr Glu Tyr Ile Lys Gly Gly Thr Leu Arg 410 415 420 ggc atc atc aag agc atg gac agc cag tac cca tgg agc cag aga gtg 1409 Gly Ile Ile Lys Ser Met Asp Ser Gln Tyr Pro Trp Ser Gln Arg Val 425 430 435 agc ttt gcc aag gac atc gca tca ggg atg gcc tac ctc cac tcc atg 1457 Ser Phe Ala Lys Asp Ile Ala Ser Gly Met Ala Tyr Leu His Ser Met 440 445 450 aac atc atc cac cga gac ctc aac tcc cac aac tgc ctg gtc cgc gag 1505 Asn Ile Ile His Arg Asp Leu Asn Ser His Asn Cys Leu Val Arg Glu 455 460 465 470 aac aag aat gtg gtg gtg gct gac ttc ggg ctg gcg cgt ctc atg gtg 1553 Asn Lys Asn Val Val Val Ala Asp Phe Gly Leu Ala Arg Leu Met Val 475 480 485 gac gag aag act cag cct gag ggc ctg cgg agc ctc aag aag cca gac 1601 Asp Glu Lys Thr Gln Pro Glu Gly Leu Arg Ser Leu Lys Lys Pro Asp 490 495 500 cgc aag aag cgc tac acc gtg gtg ggc aac ccc tac tgg atg gca cct 1649 Arg Lys Lys Arg Tyr Thr Val Val Gly Asn Pro Tyr Trp Met Ala Pro 505 510 515 gag atg atc aac ggc cgc agc tat gat gag aag gtg gat gtg ttc tcc 1697 Glu Met Ile Asn Gly Arg Ser Tyr Asp Glu Lys Val Asp Val Phe Ser 520 525 530 ttt ggg atc gtc ctg tgc gag atc atc ggg cgg gtg aac gca gac cct 1745 Phe Gly Ile Val Leu Cys Glu Ile Ile Gly Arg Val Asn Ala Asp Pro 535 540 545 550 gac tac ctg ccc cgc acc atg gac ttt ggc ctc aac gtg cga gga ttc 1793 Asp Tyr Leu Pro Arg Thr Met Asp Phe Gly Leu Asn Val Arg Gly Phe 555 560 565 ctg gac cgc tac tgc ccc cca aac tgc ccc ccg agc ttc ttc ccc atc 1841 Leu Asp Arg Tyr Cys Pro Pro Asn Cys Pro Pro Ser Phe Phe Pro Ile 570 575 580 acc gtg cgc tgt tgc gat ctg gac ccc gag aag agg cca tcc ttt gtg 1889 Thr Val Arg Cys Cys Asp Leu Asp Pro Glu Lys Arg Pro Ser Phe Val 585 590 595 aag ctg gaa cac tgg ctg gag acc ctc cgc atg cac ctg gcc ggc cac 1937 Lys Leu Glu His Trp Leu Glu Thr Leu Arg Met His Leu Ala Gly His 600 605 610 ctg cca ctg ggc cca cag ctg gag cag ctg gac aga ggt ttc tgg gag 1985 Leu Pro Leu Gly Pro Gln Leu Glu Gln Leu Asp Arg Gly Phe Trp Glu 615 620 625 630 acc tac cgg cgc ggc gag agc gga ctg cct gcc cac cct gag gtc ccc 2033 Thr Tyr Arg Arg Gly Glu Ser Gly Leu Pro Ala His Pro Glu Val Pro 635 640 645 gac tga gccagggcca ctcagctgcc cctgtcccca cctctggaga atccaccccc 2089 Asp accagattcc tccgcgggag gtggccctca gctgggacag tggggaccca ggcttctcct 2149 cagagccagg ccctgacttg ccttctccca ccccgtggac cgcttcccct gccttctctc 2209 tgccgtggcc cagagccggc ccagctgcac acacacacca tgctctcgcc ctgctgtaac 2269 ctctgtcttg gcagggctgt cccctcttgc ttctccttgc atgagctgga gggcctgtgt 2329 gagttacgcc cctttccaca cgccgctgcc ccagcaaccc tgttcacgct ccacctgtct 2389 ggtccatagc tccctggagg ctgggccagg aggcagcctc cgaaccatgc cccatataac 2449 gcttgggtgc gtgggagggc gcacatcagg gcagaggcca agttccaggt gtctgtgttc 2509 ccaggaacca aatggggagt ctggggcccg ttttcccccc agggggtgtc taggtagcaa 2569 caggtatcga ggactctcca aacccccaaa gcagagagag ggctgatccc atggggcgga 2629 ggtccccagt ggctgagcaa acagcccctt ctctcgcttt gggtcttttt tttgtttctt 2689 tcttaaagcc actttagtga gaagcaggta ccaagcctca gggtgaaggg ggtcccttga 2749 gggagcgtgg agctgcggtg ccctggccgg cgatggggag gagccggctc cggcagtgag 2809 aggataggca cagtggaccg ggcaggtgtc caccagcagc tcagcccctg cagtcatctc 2869 agagcccctt cccgggcctc tcccccaagg ctccctgccc ctcctcatgc ccctctgtcc 2929 tctgcgtttt ttctgtgtaa tctatttttt aagaagagtt tgtattattt tttcatacgg 2989 ctgcagcagc agctgccagg ggcttgggat tttatttttg tggcgggcgg gggtgggagg 3049 gccattttgt cactttgcct cagttgagca tctaggaagt attaaaactg tgaagctttc 3109 tcagtgcact ttgaacctgg aaaacaatcc caacaggccc gtgggaccat gacttaggga 3169 ggtgggaccc acccaccccc atccaggaac cgtgacgtcc aaggaaccaa acccagacgc 3229 agaacaataa aataaattcc gtactcccca ccc 3262 5 18 DNA Artificial Sequence PCR Primer 5 tggcaccgag cactcaca 18 6 21 DNA Artificial Sequence PCR Primer 6 cgacgtggat ggaattcttc a 21 7 20 DNA Artificial Sequence PCR Probe 7 ccgtccgcgt ccagggagtg 20 8 19 DNA Artificial Sequence PCR Primer 8 gaaggtgaag gtcggagtc 19 9 20 DNA Artificial Sequence PCR Primer 9 gaagatggtg atgggatttc 20 10 20 DNA Artificial Sequence PCR Probe 10 caagcttccc gttctcagcc 20 11 3332 DNA H. sapiens CDS (166)...(2109) 11 gcgccgagcc ggtttccccg ccggtgtccg agaggcgccc ccggcccggc ccggcccggc 60 ccgcgccctc cgcccccgcc tccccgggcc ggcggcggtg ggcgagctcg cgggcccggc 120 cgcccccagc cccagccccg ccgggccccg ccccccgtcg agtgc atg agg ttg acg 177 Met Arg Leu Thr 1 cta ctt tgt tgc acc tgg agg gaa gaa cgt atg gga gag gaa gga agc 225 Leu Leu Cys Cys Thr Trp Arg Glu Glu Arg Met Gly Glu Glu Gly Ser 5 10 15 20 gag ttg ccc gtg tgt gca agc tgc ggc cag agg atc tat gat ggc cag 273 Glu Leu Pro Val Cys Ala Ser Cys Gly Gln Arg Ile Tyr Asp Gly Gln 25 30 35 tac ctc cag gcc ctg aac gcg gac tgg cac gca gac tgc ttc agg tgt 321 Tyr Leu Gln Ala Leu Asn Ala Asp Trp His Ala Asp Cys Phe Arg Cys 40 45 50 tgt gac tgc agt gcc tcc ctg tcg cac cag tac tat gag aag gat ggg 369 Cys Asp Cys Ser Ala Ser Leu Ser His Gln Tyr Tyr Glu Lys Asp Gly 55 60 65 cag ctc ttc tgc aag aag gac tac tgg gcc cgc tat ggc gag tcc tgc 417 Gln Leu Phe Cys Lys Lys Asp Tyr Trp Ala Arg Tyr Gly Glu Ser Cys 70 75 80 cat ggg tgc tct gag caa atc acc aag gga ctg gtt atg gtg gct ggg 465 His Gly Cys Ser Glu Gln Ile Thr Lys Gly Leu Val Met Val Ala Gly 85 90 95 100 gag ctg aag tac cac ccc gag tgt ttc atc tgc ctc acg tgt ggg acc 513 Glu Leu Lys Tyr His Pro Glu Cys Phe Ile Cys Leu Thr Cys Gly Thr 105 110 115 ttt atc ggt gac ggg gac acc tac acg ctg gtg gag cac tcc aag ctg 561 Phe Ile Gly Asp Gly Asp Thr Tyr Thr Leu Val Glu His Ser Lys Leu 120 125 130 tac tgc ggg cac tgc tac tac cag act gtg gtg acc ccc gtc atc gag 609 Tyr Cys Gly His Cys Tyr Tyr Gln Thr Val Val Thr Pro Val Ile Glu 135 140 145 cag atc ctg cct gac tcc cct ggc tcc cac ctg ccc cac acc gtc acc 657 Gln Ile Leu Pro Asp Ser Pro Gly Ser His Leu Pro His Thr Val Thr 150 155 160 ctg gtg tcc atc cca gcc tca tct cat ggc aag cgt gga ctt tca gtc 705 Leu Val Ser Ile Pro Ala Ser Ser His Gly Lys Arg Gly Leu Ser Val 165 170 175 180 tcc att gac ccc ccg cac ggc cca ccg ggc tgt ggc acc gag cac tca 753 Ser Ile Asp Pro Pro His Gly Pro Pro Gly Cys Gly Thr Glu His Ser 185 190 195 cac acc gtc cgc gtc cag gga gtg gat ccg ggc tgc atg agc cca gat 801 His Thr Val Arg Val Gln Gly Val Asp Pro Gly Cys Met Ser Pro Asp 200 205 210 gtg aag aat tcc atc cac gtc gga gac cgg atc ttg gaa atc aat ggc 849 Val Lys Asn Ser Ile His Val Gly Asp Arg Ile Leu Glu Ile Asn Gly 215 220 225 acg ccc atc cga aat gtg ccc ctg gac gag att gac ctg ctg att cag 897 Thr Pro Ile Arg Asn Val Pro Leu Asp Glu Ile Asp Leu Leu Ile Gln 230 235 240 gaa acc agc cgc ctg ctc cag ctg acc ctc gag cat gac cct cac gat 945 Glu Thr Ser Arg Leu Leu Gln Leu Thr Leu Glu His Asp Pro His Asp 245 250 255 260 aca ctg ggc cac ggg ctg ggg cct gag acc agc ccc ctg agc tct ccg 993 Thr Leu Gly His Gly Leu Gly Pro Glu Thr Ser Pro Leu Ser Ser Pro 265 270 275 gct tat act ccc agc ggg gag gcg ggc agc tct gcc cgg cag aaa cct 1041 Ala Tyr Thr Pro Ser Gly Glu Ala Gly Ser Ser Ala Arg Gln Lys Pro 280 285 290 gtc ttg agg agc tgc agc atc gac agg tct ccg ggc gct ggc tca ctg 1089 Val Leu Arg Ser Cys Ser Ile Asp Arg Ser Pro Gly Ala Gly Ser Leu 295 300 305 ggc tcc ccg gcc tcc cag cgc aag gac ctg ggt cgc tct gag tcc ctc 1137 Gly Ser Pro Ala Ser Gln Arg Lys Asp Leu Gly Arg Ser Glu Ser Leu 310 315 320 cgc gta gtc tgc cgg cca cac cgc atc ttc cgg ccg tcg gac ctc atc 1185 Arg Val Val Cys Arg Pro His Arg Ile Phe Arg Pro Ser Asp Leu Ile 325 330 335 340 cac ggg gag gtg ctg ggc aag ggc tgc ttc ggc cag gct atc aag gtg 1233 His Gly Glu Val Leu Gly Lys Gly Cys Phe Gly Gln Ala Ile Lys Val 345 350 355 aca cac cgt gag aca ggt gag gtg atg gtg atg aag gag ctg atc cgg 1281 Thr His Arg Glu Thr Gly Glu Val Met Val Met Lys Glu Leu Ile Arg 360 365 370 ttc gac gag gag acc cag agg acg ttc ctc aag gag gtg aag gtc atg 1329 Phe Asp Glu Glu Thr Gln Arg Thr Phe Leu Lys Glu Val Lys Val Met 375 380 385 cga tgc ctg gaa cac ccc aac gtg ctc aag ttc atc ggg gtg ctc tac 1377 Arg Cys Leu Glu His Pro Asn Val Leu Lys Phe Ile Gly Val Leu Tyr 390 395 400 aag gac aag agg ctc aac ttc atc act gag tac atc aag ggc ggc acg 1425 Lys Asp Lys Arg Leu Asn Phe Ile Thr Glu Tyr Ile Lys Gly Gly Thr 405 410 415 420 ctc cgg ggc atc atc aag agc atg gac agc cag tac cca tgg agc cag 1473 Leu Arg Gly Ile Ile Lys Ser Met Asp Ser Gln Tyr Pro Trp Ser Gln 425 430 435 aga gtg agc ttt gcc aag gac atc gca tca ggg atg gcc tac ctc cac 1521 Arg Val Ser Phe Ala Lys Asp Ile Ala Ser Gly Met Ala Tyr Leu His 440 445 450 tcc atg aac atc atc cac cga gac ctc aac tcc cac aac tgc ctg gtc 1569 Ser Met Asn Ile Ile His Arg Asp Leu Asn Ser His Asn Cys Leu Val 455 460 465 cgc gag aac aag aat gtg gtg gtg gct gac ttc ggg ctg gcg cgt ctc 1617 Arg Glu Asn Lys Asn Val Val Val Ala Asp Phe Gly Leu Ala Arg Leu 470 475 480 atg gtg gac gag aag act cag cct gag ggc ctg cgg agc ctc aag aag 1665 Met Val Asp Glu Lys Thr Gln Pro Glu Gly Leu Arg Ser Leu Lys Lys 485 490 495 500 cca gac cgc aag aag cgc tac acc gtg gtg ggc aac ccc tac tgg atg 1713 Pro Asp Arg Lys Lys Arg Tyr Thr Val Val Gly Asn Pro Tyr Trp Met 505 510 515 gca cct gag atg atc aac ggc cgc agc tat gat gag aag gtg gat gtg 1761 Ala Pro Glu Met Ile Asn Gly Arg Ser Tyr Asp Glu Lys Val Asp Val 520 525 530 ttc tcc ttt ggg atc gtc ctg tgc gag atc atc ggg cgg gtg aac gca 1809 Phe Ser Phe Gly Ile Val Leu Cys Glu Ile Ile Gly Arg Val Asn Ala 535 540 545 gac cct gac tac ctg ccc cgc acc atg gac ttt ggc ctc aac gtg cga 1857 Asp Pro Asp Tyr Leu Pro Arg Thr Met Asp Phe Gly Leu Asn Val Arg 550 555 560 gga ttc ctg gac cgc tac tgc ccc cca aac tgc ccc ccg agc ttc ttc 1905 Gly Phe Leu Asp Arg Tyr Cys Pro Pro Asn Cys Pro Pro Ser Phe Phe 565 570 575 580 ccc atc acc gtg cgc tgt tgc gat ctg gac ccc gag aag agg cca tcc 1953 Pro Ile Thr Val Arg Cys Cys Asp Leu Asp Pro Glu Lys Arg Pro Ser 585 590 595 ttt gtg aag ctg gaa cac tgg ctg gag acc ctc cgc atg cac ctg gcc 2001 Phe Val Lys Leu Glu His Trp Leu Glu Thr Leu Arg Met His Leu Ala 600 605 610 ggc cac ctg cca ctg ggc cca cag ctg gag cag ctg gac aga ggt ttc 2049 Gly His Leu Pro Leu Gly Pro Gln Leu Glu Gln Leu Asp Arg Gly Phe 615 620 625 tgg gag acc tac cgg cgc ggc gag agc gga ctg cct gcc cac cct gag 2097 Trp Glu Thr Tyr Arg Arg Gly Glu Ser Gly Leu Pro Ala His Pro Glu 630 635 640 gtc ccc gac tga gccagggcca ctcagctgcc cctgtcccca cctctggaga 2149 Val Pro Asp 645 atccaccccc accagattcc tccgcgggag gtggccctca gctgggacag tggggaccca 2209 ggcttctcct cagagccagg ccctgacttg ccttctccca ccccgtggac cgcttcccct 2269 gccttctctc tgccgtggcc cagagccggc ccagctgcac acacacacca tgctctcgcc 2329 ctgctgtaac ctctgtcttg gcagggctgt cccctcttgc ttctccttgc atgagctgga 2389 gggcctgtgt gagttacgcc cctttccaca cgccgctgcc ccagcaaccc tgttcacgct 2449 ccacctgtct ggtccatagc tccctggagg ctgggccagg aggcagcctc cgaaccatgc 2509 cccatataac gcttgggtgc gtgggagggc gcacatcagg gcagaggcca agttccaggt 2569 gtctgtgttc ccaggaacca aatggggagt ctggggcccg ttttcccccc agggggtgtc 2629 taggtagcaa caggtatcga ggactctcca aacccccaaa gcagagagag ggctgatccc 2689 atggggcgga ggtccccagt ggctgagcaa acagcccctt ctctcgcttt gggtcttttt 2749 tttgtttctt tcttaaagcc actttagtga gaagcaggta ccaagcctca gggtgaaggg 2809 ggtcccttga gggagcgtgg agctgcggtg ccctggccgg cgatggggag gagccggctc 2869 cggcagtgag aggataggca cagtggaccg ggcaggtgtc caccagcagc tcagcccctg 2929 cagtcatctc agagcccctt cccgggcctc tcccccaagg ctccctgccc ctcctcatgc 2989 ccctctgtcc tctgcgtttt ttctgtgtaa tctatttttt aagaagagtt tgtattattt 3049 tttcatacgg ctgcagcagc agctgccagg ggcttgggat tttatttttg tggcgggcgg 3109 gggtgggagg gccattttgt cactttgcct cagttgagca tctaggaagt attaaaactg 3169 tgaagctttc tcagtgcact ttgaacctgg aaaacaatcc caacaggccc gtgggaccat 3229 gacttaggga ggtgggaccc acccaccccc atccaggaac cgtgacgtcc aaggaaccaa 3289 acccagacgc agaacaataa aataaattcc gtactcccca ccc 3332 12 3271 DNA H. sapiens CDS (166)...(1083) 12 gcgccgagcc ggtttccccg ccggtgtccg agaggcgccc ccggcccggc ccggcccggc 60 ccgcgccctc cgcccccgcc tccccgggcc ggcggcggtg ggcgagctcg cgggcccggc 120 cgcccccagc cccagccccg ccgggccccg ccccccgtcg agtgc atg agg ttg acg 177 Met Arg Leu Thr 1 cta ctt tgt tgc acc tgg agg gaa gaa cgt atg gga gag gaa gga agc 225 Leu Leu Cys Cys Thr Trp Arg Glu Glu Arg Met Gly Glu Glu Gly Ser 5 10 15 20 gag ttg ccc gtg tgt gca agc tgc ggc cag agg atc tat gat ggc cag 273 Glu Leu Pro Val Cys Ala Ser Cys Gly Gln Arg Ile Tyr Asp Gly Gln 25 30 35 tac ctc cag gcc ctg aac gcg gac tgg cac gca gac tgc ttc agg tgt 321 Tyr Leu Gln Ala Leu Asn Ala Asp Trp His Ala Asp Cys Phe Arg Cys 40 45 50 tgt gac tgc agt gcc tcc ctg tcg cac cag tac tat gag aag gat ggg 369 Cys Asp Cys Ser Ala Ser Leu Ser His Gln Tyr Tyr Glu Lys Asp Gly 55 60 65 cag ctc ttc tgc aag aag gac tac tgg gcc cgc tat ggc gag tcc tgc 417 Gln Leu Phe Cys Lys Lys Asp Tyr Trp Ala Arg Tyr Gly Glu Ser Cys 70 75 80 cat ggg tgc tct gag caa atc acc aag gga ctg gtt atg gtg gct ggg 465 His Gly Cys Ser Glu Gln Ile Thr Lys Gly Leu Val Met Val Ala Gly 85 90 95 100 gag ctg aag tac cac ccc gag tgt ttc atc tgc ctc acg tgt ggg acc 513 Glu Leu Lys Tyr His Pro Glu Cys Phe Ile Cys Leu Thr Cys Gly Thr 105 110 115 ttt atc ggt gac ggg gac acc tac acg ctg gtg gag cac tcc aag ctg 561 Phe Ile Gly Asp Gly Asp Thr Tyr Thr Leu Val Glu His Ser Lys Leu 120 125 130 tac tgc ggg cac tgc tac tac cag act gtg gtg acc ccc gtc atc gag 609 Tyr Cys Gly His Cys Tyr Tyr Gln Thr Val Val Thr Pro Val Ile Glu 135 140 145 cag atc ctg cct gac tcc cct ggc tcc cac ctg ccc cac acc gtc acc 657 Gln Ile Leu Pro Asp Ser Pro Gly Ser His Leu Pro His Thr Val Thr 150 155 160 ctg gtg tcc atc cca gcc tca tct cat ggc aag cgt gga ctt tca gtc 705 Leu Val Ser Ile Pro Ala Ser Ser His Gly Lys Arg Gly Leu Ser Val 165 170 175 180 tcc att gac ccc ccg cac ggc cca ccg ggc tgt ggc acc gag cac tca 753 Ser Ile Asp Pro Pro His Gly Pro Pro Gly Cys Gly Thr Glu His Ser 185 190 195 cac acc gtc cgc gtc cag gga gtg gat ccg ggc tgc atg agc cca gat 801 His Thr Val Arg Val Gln Gly Val Asp Pro Gly Cys Met Ser Pro Asp 200 205 210 gtg aag aat tcc atc cac gtc gga gac cgg atc ttg gaa atc aat ggc 849 Val Lys Asn Ser Ile His Val Gly Asp Arg Ile Leu Glu Ile Asn Gly 215 220 225 acg ccc atc cga aat gtg ccc ctg gac gag att gac ctg ctg att cag 897 Thr Pro Ile Arg Asn Val Pro Leu Asp Glu Ile Asp Leu Leu Ile Gln 230 235 240 gaa acc agc cgc ctg ctc cag ctg acc ctc gag cat gac cct cac gat 945 Glu Thr Ser Arg Leu Leu Gln Leu Thr Leu Glu His Asp Pro His Asp 245 250 255 260 aca ctg ggc cac ggg ctg ggg cct gag acc agc ccc ctg agc tct ccg 993 Thr Leu Gly His Gly Leu Gly Pro Glu Thr Ser Pro Leu Ser Ser Pro 265 270 275 gct tat act ccc agc ggg gag gcg ggc agc tct gcc cgg cag aaa cct 1041 Ala Tyr Thr Pro Ser Gly Glu Ala Gly Ser Ser Ala Arg Gln Lys Pro 280 285 290 gtc ttc gca agg acc tgg gtc gct ctg agt ccc tcc gcg tag tctgccggcc 1093 Val Phe Ala Arg Thr Trp Val Ala Leu Ser Pro Ser Ala 295 300 305 acaccgcatc ttccggccgt cggacctcat ccacggggag gtgctgggca agggctgctt 1153 cggccaggct atcaaggtga cacaccgtga gacaggtgag gtgatggtga tgaaggagct 1213 gatccggttc gacgaggaga cccagaggac gttcctcaag gaggtgaagg tcatgcgatg 1273 cctggaacac cccaacgtgc tcaagttcat cggggtgctc tacaaggaca agaggctcaa 1333 cttcatcact gagtacatca agggcggcac gctccggggc atcatcaaga gcatggacag 1393 ccagtaccca tggagccaga gagtgagctt tgccaaggac atcgcatcag ggatggccta 1453 cctccactcc atgaacatca tccaccgaga cctcaactcc cacaactgcc tggtccgcga 1513 gaacaagaat gtggtggtgg ctgacttcgg gctggcgcgt ctcatggtgg acgagaagac 1573 tcagcctgag ggcctgcgga gcctcaagaa gccagaccgc aagaagcgct acaccgtggt 1633 gggcaacccc tactggatgg cacctgagat gatcaacggc cgcagctatg atgagaaggt 1693 ggatgtgttc tcctttggga tcgtcctgtg cgagatcatc gggcgggtga acgcagaccc 1753 tgactacctg ccccgcacca tggactttgg cctcaacgtg cgaggattcc tggaccgcta 1813 ctgcccccca aactgccccc cgagcttctt ccccatcacc gtgcgctgtt gcgatctgga 1873 ccccgagaag aggccatcct ttgtgaagct ggaacactgg ctggagaccc tccgcatgca 1933 cctggccggc cacctgccac tgggcccaca gctggagcag ctggacagag gtttctggga 1993 gacctaccgg cgcggcgaga gcggactgcc tgcccaccct gaggtccccg actgagccag 2053 ggccactcag ctgcccctgt ccccacctct ggagaatcca cccccaccag attcctccgc 2113 gggaggtggc cctcagctgg gacagtgggg acccaggctt ctcctcagag ccaggccctg 2173 acttgccttc tcccaccccg tggaccgctt cccctgcctt ctctctgccg tggcccagag 2233 ccggcccagc tgcacacaca caccatgctc tcgccctgct gtaacctctg tcttggcagg 2293 gctgtcccct cttgcttctc cttgcatgag ctggagggcc tgtgtgagtt acgccccttt 2353 ccacacgccg ctgccccagc aaccctgttc acgctccacc tgtctggtcc atagctccct 2413 ggaggctggg ccaggaggca gcctccgaac catgccccat ataacgcttg ggtgcgtggg 2473 agggcgcaca tcagggcaga ggccaagttc caggtgtctg tgttcccagg aaccaaatgg 2533 ggagtctggg gcccgttttc cccccagggg gtgtctaggt agcaacaggt atcgaggact 2593 ctccaaaccc ccaaagcaga gagagggctg atcccatggg gcggaggtcc ccagtggctg 2653 agcaaacagc cccttctctc gctttgggtc ttttttttgt ttctttctta aagccacttt 2713 agtgagaagc aggtaccaag cctcagggtg aagggggtcc cttgagggag cgtggagctg 2773 cggtgccctg gccggcgatg gggaggagcc ggctccggca gtgagaggat aggcacagtg 2833 gaccgggcag gtgtccacca gcagctcagc ccctgcagtc atctcagagc cccttcccgg 2893 gcctctcccc caaggctccc tgcccctcct catgcccctc tgtcctctgc gttttttctg 2953 tgtaatctat tttttaagaa gagtttgtat tattttttca tacggctgca gcagcagctg 3013 ccaggggctt gggattttat ttttgtggcg ggcgggggtg ggagggccat tttgtcactt 3073 tgcctcagtt gagcatctag gaagtattaa aactgtgaag ctttctcagt gcactttgaa 3133 cctggaaaac aatcccaaca ggcccgtggg accatgactt agggaggtgg gacccaccca 3193 cccccatcca ggaaccgtga cgtccaagga accaaaccca gacgcagaac aataaaataa 3253 attccgtact ccccaccc 3271 13 408 DNA H. sapiens 13 ggcacgccca tccgaaatgt gcccctggac gagattgacc tgctgattca ggaaaccacc 60 cgcctgctcc agctgaccct cgagcatgac cctcacgata cactggccca cgggctgggg 120 cctgagacca gccccatgag ctctccggct tatactccca gcggggaggc gggcagctct 180 gcccggcaga aacctgtctt gtaagtcagc ctgctcctct gttcagctgg gtgctttcac 240 tcctgctggg gctcaggggc tgtgggacct aggtcgggga gccagccctg cacaaatgca 300 gcccaggctt gagccaggga ggtggaggct gcagtaagct gtcatcacac cactgctctc 360 cagcttgggt gacaaaacaa gacccactct caaaaaaaaa aaaaaaaa 408 14 39001 DNA Homo sapiens 14 ccgagttccc ggctgcagct ggtttgggtc tgtgggggcc tcgcgcaggg aagcggaggc 60 cccttctcta ggctcctaca tcctcctggc ctgccccagc agagctagcg gccctccgag 120 acagctgggc ctcgcagctc ccagccgagg cctgtcgctg ggagaaagaa gaatgaatga 180 agggcggaag gggcggggag gggcggggcg cgccgcgcca gctcgcggcg cccccaggcc 240 gggccgcagc ggggcgtggc ctctagaggc gtggccttgg gcgtgggggc ggggttttct 300 ggtgggggcg gggccgcgca gggccgttcc cgccgccggc cgcccgcgcc ccgcccccgt 360 gccgcagctc acaggccccg cctcggtccg cccctccgct cgctccccaa gccgccgcgg 420 cgccgagccg gtttccccgc cggtgtccga gaggcgcccc cggcccggcc cggcccggcc 480 cgcgccctcc gcccccgcct ccccgggccg gcggcggtgg gcgagctcgc gggcccggcc 540 gcccccagcc ccagccccgc cgggccccgc cccccgtcga gtgcatgagg ttgacgctac 600 tttgttgcac ctggagggaa gaacgtatgg gagaggaagg tgcgcgggcc gcggggtgtg 660 gggcgagggc ctggaggggg tgcccggggc agcgtggggc acgggagggg gccgggtctg 720 ccaggaggcc gcgcccctgc ctcctccggg atgagctcgt ccttacgaag cccgcaggcc 780 cctccctgtc cccctcccgc ccgggatccc cctccccggc ccccggcgag ctgccctcct 840 gcgggtctgg gggcccctgg accctttttc ctcctcccac gtccccccgc gaaggactcc 900 cagacactgc ccaccccgcg tcggcctcca tccgcgtgct ctgtccacca cccgggcctc 960 gccctggggc caccctttat ccagtctcgg aagaaagagc ggctggggaa gccgcagccc 1020 cgggtcccag tggccgccgg gcgcctgccc ggctctgtga ccttgagcca ggcgctgact 1080 tcctggtcct cagtttcccc ttctgtacat ttggaaactg ggtagttgcc cccccggtgt 1140 cggtgattgg gggccagatg ggtagagcgg agataggcgt ccaggaagcc ggaggccgtg 1200 tactgcggga gcctcatcca ctctccctgt ccgtgcccca aacccggtgc ctgccctcag 1260 tcttggctgg gagcatgact catcctaacc tcctctttag ccccttctcc ctcactgggg 1320 cccaaggcgc agtactgcac tgcagttagg gttcaaggac tcccccagcc taggacaggg 1380 tctgggggcc cctccttgga tctccttcgc tgacctgtca cttagatcca cctggcccca 1440 aggcagggcc tgactccaca cctccccctg ccaccaactc ttcccaggcc catgaaaacc 1500 tgattggggt aggggcccac cttcctgtag cccctgccta cctaaggtac ctgcgtcttc 1560 acagagggtc aggctgttgt ggccttggga cctagctatg tgactgggca agccatgcca 1620 tctctggggc tcagtctccc cttctgtaca gtggagaggg gcaggtctgg ggcattttcc 1680 agggcccacc agctccaagg gtgccaggcc ccaaggatga ctaagcatgc ctgtggctgg 1740 ctagaggagg tgccaggcct ccctgggaca ggtgtctggg agtacccagg tctgcagccc 1800 cctccccttg ccaagccagg gcattcattg ccaaggatct gttagggccg gcacctccag 1860 gcttcctgcc cttgacctcc cagctggctt cagcccagga tgcactaatc cagccctgtc 1920 cagtccctgc ctttgaaggg ccctcttagt acttcttcct gggcaggaga gggaagaaag 1980 gaggctgtga taggaatgtc acccactgcc ttatccctaa agccactgct tcctttctcc 2040 tcatttacct tgccagatcc aatgctatag cgggaggatg gacctgatcc tcctcctaag 2100 ctgatacata gggaaacagg gccagagaag cttggcaacc tagtcagtat ctcagcaaga 2160 ctcaggccag cgccctttct tctcctattt ggcacagcga ctgccctgcc tgggcgctgc 2220 acatgtgcag tgtgcgagga ttggtgcagg tgtaggtata tgtggggtgg gcagggcaag 2280 ctgggcctgc accagatcac acttcctgag aatgcttccc aactcccttc ccaccctgca 2340 ggaagcgagt tgcccgtgtg tgcaagctgc ggccagagga tctatgatgg ccagtacctc 2400 caggccctga acgcggactg gcacgcagac tgcttcaggt agggtggggt gcccagggcc 2460 tgtgttgccc taaacaaggc ctgccagaga ggacaggctg gtcaaggaat gggggaggcc 2520 gggatatgcc tcctggtgcc gtcccctatt gtgacttcgt ggccttaatt taccatttat 2580 gacatgaggt gttttgacta gaaaatccct acaggccttc ctgttgtcat tttatttatc 2640 tatttttttt tctttttgag acggagtctc gctctgtcac ccaggctgga gtacagtggt 2700 gcgatcttgg ctcattgcaa cctcttcctc ctgggctcac gcagttctcc tgtgtcagcc 2760 tctggagtag ctgggattac aggcgtgcac caccacgccc agctaatttt tgtattttta 2820 gtagagacgg gttttgccat gttagccagg ctggtctgaa acttctgacc tcaagtgatc 2880 ttcccacctc agcctcccaa agtgctggga tgacagacat aaaccaccgc tcctggcctc 2940 attttatttt cttttatgta tttttctttt ttcgaaatgg tcttgctctg ttgcccaggc 3000 tggagtgcag tggtgccatc tcggctcatt gcaacctcca tctcccgggc taaagtgatc 3060 ctcctacttc agcctcccga gtagctggga ttataggtat acaccacaat gctcagctaa 3120 ttttttaaat tttgtgtaaa gacagggtct cactattgag acccaggctg gtcttgaact 3180 tgtgacctca agcaatcctc ctgccttggc ctccgaaagt gctaggctta caggcgtgag 3240 ctaacgcctt ggcctctgtt gtcatcctag atctctgaga tctaaatctt agagaggatg 3300 ggagagacct ccaattgagc cagtgcctgc aattcagccc cctgctggca cccagacagg 3360 gggaagagtt ggaaggaatg tccctcctgc cttctgggtg ttcatgctct tgcagggagg 3420 gaagacaaac caggccttaa gggaaaccag gccaccctca gtgtcttcca ggctgcttgc 3480 gaacatgcat aacccagtca caccagcccc agtgtccaga cacacaccca caggtaggaa 3540 gaaagtaggg tcagggttgt ggcggaggat aaagagtaca tgaggacctg aaggtcaccc 3600 agtaggacca tcctgagaag ccaggagcag gggtctacct gccttgagcc agagcagggc 3660 cagagcaggg gtctcaaagg atgtgagatt tcctgggtag aaaagtagag tggaggtggg 3720 gcgtggtggc tcacacctat aatcccatca ctttttgggg ctgaggtggg cagatcactt 3780 gagttcagga gttcgagaca agcctgggca atatggcaac accctgtctc cactgaaaat 3840 acaaaaaatt agccgggcgt ggtggcgcat gcctgtagtc ccagctactc aagaggctga 3900 ggtggcaggg ttacttgagc ctgggaggtg gaggctgcag tgagctatga tcgcaccact 3960 gcactctagc ctgggcaata gagcgagacc cagtctcaat ttttaaaaaa gaaagaaaga 4020 aaaacaaatg gtgtgggaga gaattacagg catagtcacc aaacagcaag gttcagggga 4080 gaaaactcca taaaagggta gaaggtgaag cttctgggat gcccagcagg ggtcaagaca 4140 tccaccacta ggactttatt ttaggcttct gccttggttt attttttggt ttttggtttt 4200 tttgagacag tcttgttgtg tcgcccaggc tggggagcag tggcgcgatc cctcctcact 4260 gcaacctccg cctcccaggt tcaagcgatt ctcctgcttc agcctcccaa gtagctggga 4320 ttacaggtgt gcaccaccac gcccggctaa gttttgtatt ttcagtagag atagggtttt 4380 gccatgttgg ccaggctggt ctcgaactcc tgacctcaag tgatctgccc gcctcagcct 4440 cccaaaatgc tggaattaca ggcatgagcc actgcacctg gcctcggttt gtttttttgt 4500 ttcttctttt cttttttttt acacagggtc ttgctgtgtc acccaggctg gcgtgcagtg 4560 gtgagatcat agcccactgt agcctccagc tccaactggt tcaagcgatc cttctgactc 4620 agcctcccaa agtgctggga ttacaagcat aagccaccat gcccagcctg ttttttcttt 4680 tttaggaata acgtctaacg ttttctaaca ttcagtaagg gacaacccct gttctaagta 4740 ctttgcatag ttagatatta gtgctgtctt tgttttgcca gagagaaaat tgggacacag 4800 agaggttaat tctcttgatg aaagtcacac agccagtgag tgaaatgaac acactcagtg 4860 tggctgaaag gagacagaca gcatgccctg ggattctgca tcaggtgctc agaaagaggc 4920 cttcgggggg caagagggct ctcaacaggc agaggaaacc atctgcacag cggtgggatg 4980 gtgcggactg ctgagggaac aggaacagtt cccttggaag gaacagaata agctgaggga 5040 tccaacaaga aacaaagttg agaccgattc gtgaagggcc ttgaatgcca agataaggag 5100 tttcagaagt caggatgggg gtggtggctc atccccgtaa tcccagcact ttgggaggcc 5160 gaggcaggca gatcacttga ccccaggagc ttgagaccag cctggccaac gtggtgaaac 5220 ccccgtctct actaaatatt caaaaattag ccaggcatgg tggcacatga ctgtaatcca 5280 agctactcgg gaggctaagg aaggagaatc acttgaacct gggaggcgga ggctgcagtg 5340 agctgagatc acgtcactgc actccagcct gggagacaga gcgagactcc atctcaaaaa 5400 aaaaaaaaaa aaaaaataga agggagtcgg cagaaagcca gggaggggct ggggtgacat 5460 gctgttgaag aatgccatcc cagtgggccg gtggtggtat ctaggcaggg aagggactgt 5520 cccagtaact caagggtctg agctcatagg acctgacctg ggacagtgac tgaggatgga 5580 gagaatttca ggcagaaggg acagtttttg gtgagtattt gtcatattgg ctaccatgca 5640 ttgagcactc ttcatgctaa tttgttaaat cttcatcata actctatgag ggactgtatg 5700 tgcccagttt gcatgggaga aacagagatt ccatgcaatc aagtgcctcg ctgaaggttg 5760 taacatctag agctgggact aaaaccttct cactccacat cgccacagag taggaaaggc 5820 aggggctggc ggtggcacat gcctataatc ccagcccttt gggaggcgga ggcaggtgga 5880 tctcttgagc ccaggagttt gagaccagtg tgggcaacat agtgaaacct tgtctctaca 5940 aaaaaattag ctgagcatgg tggtggtgcc tgtagtccca gctactcaag ggcgctgaca 6000 tgggagggtt gcttgagcct gggaggtgga ggttgcagtg agctatgatc acaccactgc 6060 aagccagcct gggtgacaga gtgaaatccc atctcaaaaa aaagaaagaa aggaagaaag 6120 aaaaagacag gggcttcggg gagggcatgg gcactggcga atggcagggt ggaacctgaa 6180 gccatctggt tttctaacct gggcactggg gagttggtgg tttgttgact ctgatggaat 6240 tgggggtcat gttggggagg agacatgctc atctgtgttg agctggaggg gacatgggct 6300 atccatggtg gctgtgtcct gcccagagct agccatggga gcctgagtcc agttggaggt 6360 aggaaagtca gaaaaaacgg ccgcctcgga gctggccctg agatggtgag tgggatttgt 6420 gatagggcca agacgaatga agggaagaac tttggggacc cctgtgtctg cggtgagggg 6480 ggagatggag ccttgggtga tggagagagg gtcaggagta gagccacaga agccacagga 6540 gggaagccgt gttacaggat gggtgtacct ggctttggag tggcctgtcc caaatcactc 6600 accaggagag gggtgagtcc ccaggtcagg gcagtaaaga ggaggcatgt ttgtgctgtc 6660 cctggtgtag tgaaactcaa gaaggaagcc aggtgcagtg gctcacgcct gtaatcccag 6720 cactttggga ggccaaggca ggcagatcac ctgaggtcgg gagtttgaga ccagtctggc 6780 caacatggtg aaaccccatc tctactaaaa atacaaaaat tagccgggcc tgttggtggg 6840 cgcctgtaat cccagctact caggaggctg aggcagaaga atcgcttgaa cccgagaggc 6900 agtgattgca gtgagtcaag atcgcgccac tgcactctag cctgggtgac agagcaagac 6960 tccatctcga aaaaaaaaag tctcaatatg gggaaagatc cactagaagt aagagccatg 7020 gcttctacct cgtggcttgt gggtgtgata ctcccaacag tccccaaagc tggtggtcct 7080 caccgcgtga cagtgagcag agcagctcag agggggtcac tgctcacctg ggtgcatggc 7140 tgaccacagc caggctggct ctcagtggga tgcccaaggt gctagactct gcttagtctc 7200 cctcgggccc tgggcttgag gcattgggcc cggcccagac ctcatttcat gcactgagac 7260 ctttgttcca gggcccctca cccctctgaa ggtgttcggg caggggcaat gtgataaggc 7320 catgaggggt ctgcagcctc cagccccact ggggaggtgg ccagtgattt ccactttcct 7380 ggcccctctg catgcccctc ccagtggaac ttcctagggt ccctgagtca gtcacttgca 7440 aataattatg gcgtgcccac tctgcattag gcccctctca caacaaccca gtaagggggt 7500 gctatttatt tattaaagcg attttttttt tgagtctcgc tctgtcgccc aggctggagt 7560 gcggtggcgc aatctcggct tactgcaagc tctgcctccc gggttcacac cattctcctg 7620 cctcagcctc ccaagtagct gggactacag gcgcccacca ccacacccgg ctaatttttt 7680 tttgtttgtt tgtattttta gtacagacga ggtttcgctg tgtgagccag gatggtctcg 7740 atctcctgac ctcgtgatcc gcccacctca gcctcccaaa gtgctgggat tacaggcgtg 7800 agccaccgtg cccggcaata ttaaagcgat tttaaggcca aggctggtaa ctcacgcctg 7860 taatcccagc actttgggag gctgaggcag gaggactgct tgaggccagg agtttgagat 7920 caacctaggc aacatagtga gactccatct ctacaaaaaa attagccagg cgtggtggtg 7980 cgtacctgta gtcccagcta ctcaggaggc tgagatggga ggatcatttg aacccaggat 8040 gtcgaagctg cagtgagctg tgatcacgcc actgcactct ggcctgggca acagagcgag 8100 acactgtctc aaatttttaa aaagcgattt tacaaatgag gtgcagagtt cagtcacttg 8160 ccaaaagtct cacagcgcgt gaggagtaga atcaggactc gaaccgaggc agcctggctt 8220 cagagcctac agtgtaacca cagcttagtc ccacacctcc cagaccaaca gggtccctgc 8280 cttctagtgg gcaagacact cagtgaacaa atgtagtgtc aggtattggg ggacagcact 8340 ctcaggaagt gatgtttaag ggacagaatt gaagggagca gtgtttagag gatgtcgggg 8400 gtagggccgg tgcatgtgca aaggccttgg ggtgggaatg tgcttggcac aactgaggac 8460 cacaaagcca gcgtgcggga gtgcagtcag tggccagggg tgcatagagc cttgtgggcc 8520 ccgtggaagg tgccgttggc tgtacagctt tttttttttt tttttttttt tttttttgag 8580 acagagtctc gcttttgttg cccaggctgg agtgcagtgg cgtgatctca gctcactgca 8640 acctccgcct cccgggttca agcgattctc ctgcctcagc ttcctggtag ctgggactac 8700 aggcgcccac caccacacct ggctaatttt tgtgttttta atagagacgg ggtttcacca 8760 tgttagccag gctggcctca aactcctgac ctcaagcgat ctgtctacct cagcctccca 8820 aagtgctggg gttacaggca tgagccactg cgcacaggca gctgtgcatc tttgaatgtc 8880 ataacctgag catctgagag ctgctcctgt cccctggccc ctgctcttga ggaagtccca 8940 cgctgatagg acagacaggg tcataagtgc tgtgatgggg gcctgcaggc tgctggaggg 9000 ctcagccggg accagatgct gcccctcttt gtagagtggg acaattgctg caggcccatg 9060 ggacctctgg tattagccct gagggttgtc actccggggc ctgccccttt ctgtgttctg 9120 acctcccagc cccttgcagg ccctgcctcc cggaaggtta tgaccaggct tggactggtc 9180 caggcttccc tttggctcac atactgcctc tgcgaggtcc cctccaggaa gcctcctgtg 9240 cacaaccccc agggctgccg catccctggt agcatctcct tggcagctgg gtgggctggc 9300 cctgggcaag gagggctgag catgctgctg gcctgtgggg ttggagcagc ggcgggatgc 9360 aacctccctt tcttcagggg acctttttgg cgaagacaaa ctgtccatag gaagtcgacc 9420 tctgttccct tgggggcagc agtggaagag gcagctgctt ttgagcttgt ccctgtcccc 9480 agagaagcct gaggccttca gtgccgttgc cagggccgag gctgaggagc ctacagcgtg 9540 tgttcaggac tgagggccag ggacgggcca caggctccct gcctggggtc caagcctaga 9600 tcgctcgctc cccacccgca ccaaagccca ggcaaagggt gcttcagcca cttcctgttg 9660 caggctcaga ccaagtcccc tggcacccac gcggctgcag cctcctcctg tgcgctgcag 9720 ccacgctggc cccaccctct gcagcctcca atcctgagcc cctgagggag gatggggaag 9780 cagctggtct ggccacccct gccctccctt agacctccag agcccccagt gtagccacag 9840 aggatgctgt tggcttcagc cccaagaaga cgccgcttcc tccagagggc taagtaagtg 9900 ggaatccccc tccctacttg tcctgggctc caggcagggc ccctggtgta aggcctgggg 9960 ctggaagccg acccacctag gtccaggctc tggggcagaa ctgaaactcc ttggttactg 10020 tcggctgcag cctgggagca ggccactgcc aaagctgtgg gtccttccag gacagtctcc 10080 ccatgaggcc ggaagctttg tggatgccca gcctggggcc gcggggagct ggcaggtcag 10140 tggcagacac tggtgggcag acctagtgtc tggtagaaca ggcatcaagg aagtggtgac 10200 cggagggaag ccaagtgcac tcaaaccctc gggtgagtca tcaccgccgg gtctttcaca 10260 gctgctgaaa gtgagcaaca gtgatgaagg tttgtgagtt tctgcgtgag cgagtgaatg 10320 gaccagtagc agtttccagg ttgtggaaga gcgttccctc cccgggatgg ggacacttgg 10380 ttacagcaat tcctaatccc ccacccaccc accgcccact gcagaggtat gcgggggccc 10440 tgcttcctgc aggcaggagt gaggggcact cctgtgatgt ggcacccctg tgaccgaggt 10500 catgtgtgat cggtgtaagg gcaggaagcg agtcattggt ctgcaccagg cgtgggggct 10560 tctgcgaggg caggacccaa agtcggcctg gcctcccggc tgcagcactc ctttcccttt 10620 cgaattaggt tagagccctg ggacgggagg tgccctgtag accacccccc tcaccaactt 10680 ccgtcctccg ccccaccccc gcggtgatcc ggtgaactgc cggccccctg ctgtgcaccg 10740 agtggggcag tgaccctgac gtggcgtctc ctgccgcccc tgccaccgcc accacctccg 10800 gtggcccagc ctccgcattc cccaccccca tggaggaatg caccaggcct cccttcctgg 10860 atgcacccct cacccacatg cttccaaacc ctggcatttt ctgctccccc tttactccca 10920 ccccttcccc taggctccca gacaaagggg aagtggctgg atcctcttaa agggacagtg 10980 tcccaccagc ttactgctga actcccctcc tcaaccccag ttccctagtt acagttaatt 11040 agcattagca gacagcccat gagtgatacc catgcaggcc ccaggctgtg gagagtttcc 11100 tgggtaggaa acagccctta aggtccctca tctcatccag gtcccagtct ttcctacctg 11160 cctctctcct agattgtggc cctttggagc ctggttcttc tgtccctgtg tgaccgacac 11220 atagcaccca aacagtggca gagcgggacg gaccccctag cctgttctct gtgtgggtct 11280 gtaccctgac ccagacatgc ccccccacag caggacccag gggggcacat gtgtgcctgc 11340 gggttcactg gggcacccgc atttggttta ttttattttt tagagagagg gtcttgctgt 11400 gtcacccagc tggagtgcag tggtgtaatc atagcacact gcagccttca actcctgggc 11460 tcaagcgatc ctccctcccc agcctcccta gtagctggga gtacaggacc cactgtatcc 11520 tggctaattt tttaataatt ttttaagaga tggggtctta ctgtgttgcc caggctggcc 11580 tcaaacctct ggcctcaagt gatcctccca ccttcgcctc ctgaagtgct gagattacag 11640 gcatgagcca ccatgcccat cccagactga catttctata tttgttcatc ctggctgggc 11700 agggctgctg gtccccaccc caccgggatg cttggctggg aaaaagccgg gaatgtaggt 11760 ctaaccctgg cctgtgttgt ggcacctaca gcctggcatt cctccccatc tgcccttcaa 11820 ggccccacca accaggcctc cttggtagcc tctagtgagg aaacaggcga accgtggctt 11880 tgatgaccct gcacacctgg ggattctcct ctatttttct ttttcttttt tttttttttg 11940 gagacagagt ctcactctgt cgccaggctg gagtgcagtg gcacaatttt ggctcactgc 12000 aacctctgcc tcccaggttc aagcgattct tctgcctcag cctcccgagt agctgggatt 12060 acaggtgccc accaccatgc ctggctagtt tttgtatttt tagtggagac tgggttttgc 12120 catgttggcc aggctggtct cagactcctg accccaagtg atctgcccac ctcggcctcc 12180 caaagtgctg ggattacagg tgtgagccac cgctttggga ggccgaggtg ggcggatcac 12240 gaggtcaaga gctcaagacc atcctggcca agatggtgaa accccatctc tactaaaaat 12300 acaaaaaatt agctgggcat ggtggtgtgt gcctgtagtc ccagctactc aggaggctga 12360 ggcaggagga tcacttgaac ctggaaggca gaggttgcag tgagccgaga tcgagccact 12420 gcactgcagc ctggcgacag agcaagactc cgtctcaaaa aacaaacaaa aagaaaactt 12480 gttctaattc ttacaaaggt gcctgtagcc gaggcagggg cccaggtgag gtggaggagg 12540 gcgggagtgg acgtctcagc ccggcccctc tcctgcaggt gttgtgactg cagtgcctcc 12600 ctgtcgcacc agtactatga gaaggatggg cagctcttct gcaagaagga ctactgggcc 12660 cgctatggcg agtcctgcca tgggtgctct gagcaaatca ccaagggact ggttatggtg 12720 agcgccccct gccttgcaca ctcacctggg gtgggggtat ccaagcagac cccatgctcc 12780 aggtctctct cccatcattg tctctcctgg tctccttttt gctggtcttt ggagctgctt 12840 tctgagcctg actgtctgtc tgtatccctc agcgccccca tctatggagc cagctctgtc 12900 caggagctca gcagctggcc agccgggtcc ctgcagttgt ttttttggtg acacccttgg 12960 aagaggccta ggggaggatc tgtgggggtt gttgggtctg ctgagctggg ctgttccctc 13020 ctcacccccg caccaggtgg ctggggagct gaagtaccac cccgagtgtt tcatctgcct 13080 cacgtgtggg acctttatcg gtgacgggga cacctacacg ctggtggagc actccaagct 13140 gtactggtga gtgccttggc ccctccctga gcctaggagg cccacctgtg tcacagatct 13200 gcaagggtgc tgactctccc acacccgggc ctcctgccct ttcccatggg gtgaggtttg 13260 ttggggcaaa tgttcatatc tcctttccca tcccggcatg gaaacaagtg agaaataaca 13320 cacagaagtc agtgtgaaaa agcctcagac ggccaggcat gctggctcac gcctgtaaac 13380 ccagcacttt gggattccga ggtgggtgga tcccttgagg ctaggagttc aagaccagcc 13440 tggccaacat ggtggaaccc catctctatt aaaaatacaa aaattaacca ggtgtggtgg 13500 cgggtgcctg taatcccagc tactcaggag gctgaggcag gagactctct tgaacctggg 13560 aggtggaagt tgcagtgagc caagattgca ccactgccct ccagcctagg caacagagca 13620 agactctgtc tcaaaacaga aaacctcaga cgtcagcttt cttactggcc atgactgcag 13680 catggtgctg gcacaaacca ccagaggtgg ggtggatgcc acaagttaag gacaccatcc 13740 ccagcataac tgctccctct ttagacacca gccacaagtt caggggtccc caacccactc 13800 acacttctga ccgactggct acaaattcag ggactcccaa gaccctgcca agtttgatcg 13860 tttgctaaca gactcacaga actcaggaaa tcctccattt ttatcccagt tttattatga 13920 aggacacagc tcaggtccga ccaaatgaag aagcatctcc cctccctccc ctagcacatc 13980 aatgtgatca ccaaccagga agcttcactg agcttcagca gccagagttt ttattgggat 14040 ttcattacat cgtcatgact gattgagtca ttggccgtat gatcaagctt agtctctagc 14100 ccccgttctt ggaggtcagg ctggatgaaa gctgcaaccc tcttcaaatc acatgatgta 14160 tctttgcggg gctgagtcat ctcattagta tcaactcagg aatagtctga ggggctcatg 14220 aataacaaag ataccccatt ccaaggactt agagtctccc tcccaggaat caggacaaaa 14280 cccagacaga ttctttctta tacaacactg atcaagctgg attagaggac aacgtggctt 14340 gatcccagat gggcttttaa tgacttcctc ctgaactgga tttatcctca ggccttgtcc 14400 tggccgcctt acaggatcac agcgagtaga cagacccgaa tgactcagag ggacgagggc 14460 tggctgggca cgcacagttc ctgctcccag ttccatagga agagtgaaag aaaagaaagc 14520 tggccaggtg cagtggctca cccctataat cccagcactt tgggaggcca aggcaggcag 14580 atcacctgag gtctggagtt tgaggccagc ctggccaaca tggtgaaacc gtctctacta 14640 aaaataagaa attagccagg catggtggtg cgtgcccgta atcccagcta ctcaggaggc 14700 tgaggcagga gaatcgcttg aacccaggag gcggaggtta cagtgagcca agatcacacc 14760 actgcacttt tggacaattg ctagctttcc ttttcttttg agacagagtc ttgctttgtc 14820 acccaggctg gggtgcagtg ttgtaatcaa cagagtgaga ctccatctca aaaaaaaaaa 14880 aaaaaaagga agggattggg ggaagagcct ggggctgggg gctgcagaga tgctgaaatt 14940 gatgacgccc ttgacactct tttcttccca ccccggcggc tcttgcagcg ggcactgcta 15000 ctaccagact gtggtgaccc ccgtcatcga gcagatcctg cctgactccc ctggctccca 15060 cctgccccac accgtcaccc tggtgtccat cccagcctca tctcatggca agcgtggact 15120 ttcagtctcc attgaccccc cgcacggccc accgggctgt ggcaccgagc actcacacac 15180 cgtccgcgtc caggggtgag tggccggcct gccgaggctg ccgtcggtgt ggctatggct 15240 gttgatgtgg gtggcagagt ctggcactgg gggccctgaa aatgaatggg cgagtgtttg 15300 ggtacagatg gggcccagtt ctgacaacct ggtttgccag atttctggcc cagtcattcc 15360 tctgaatacc attacaaatg ccagatacaa taaaaagaca ttttcaaccg ggcatggtgg 15420 cccacacctg taatctcagc acttcgggag gccgaagtgg gtggatcacc tgaggtcagg 15480 agttcgagac cagcctgggc aatgtggtga aaccccgtct ctactaaaaa tacaaacgta 15540 gccaggcatg gtagtgtgtg cctatagtgc cagctgcttg ggaggctgag gcaggagaat 15600 cacttgaacc caggaggtgg aggtttcagt gagccccgac tgccattgca ctccaggctg 15660 ggcaacaaga gtgtaactct gtatcaaaaa aataaaaata aaaaaaacac actcaaaaaa 15720 taaaaagaca ttttctttag tccatgtctg atccaacaag aaagaggagg aaccaagtca 15780 agaatgagtg aagaagctgg gcgcagtaac tcacacctgt aatctcagca ctttgggagg 15840 ccaaagtgag aggatcactt aaggccagaa gtttgagacc agcttgggca acatagcgag 15900 acctgcatgt ctacaaaaaa aaaaaaaaaa ttaaaaatta gccaggcatg gtgaaatcac 15960 tgaacacata aaggctgggc atggttgctc acacttataa tcgaaacact ttgggaggct 16020 gagatgggag gatcacttga ggccaggagt tcgaaaccag cctgggaaac attgtagtca 16080 cagctacttg ggaggctgag gcagaaggat ctcttgagcc caggaagtgg ctacagtgag 16140 ctataattgc acgactgcac tctaggctgg gcaatggagc aaaaccctgt ctcaaaaaaa 16200 tggggcaggg ctgataaaga ttagattact gtgtgacttt gagcagctgc tttctctcta 16260 ggctttgggg gtctgtttga acaatgaggg agttggatac cttggagctt tctaagattt 16320 ctgtggcgcc tttattgaca ccttgagaag tagcatgcag tgtttctact tttgggcaat 16380 tggtcacttc tttttttttg agacagtctc actctgtcgc ccagtctggg gtgcagtggt 16440 gtgataccag ctcactgcaa cctccaccca caaggttcaa gcaattcttg cacctcagcc 16500 ccctgagtag ctgggactac aggtgaccac atgtggctaa tttttgtatt tttagtaaag 16560 acagggtttc accatgttgg ccaggctcgt ttcaaactcc tgggctcaag tgatcctccc 16620 ttctcggcct cccaaagtgc cgggattaca ggtgtgagcc accgtgcccg gcccaagtgc 16680 tagctttctc tctctctttt tttttttttc gagacggagt ctcgctctgt cgcccaggct 16740 ggagtgcagt ggtgtggtct cggctcactg caagccccgc ctcctgggtt cacgccattc 16800 tcctgcctca gcctcccgag tagctgggac tacaggcacc tgccaccatg cccggctaat 16860 ttttttttta tatttagtag agacagggtt tcaccatatt aggcaggatg gtctcgatct 16920 cctgacctcg tgatccgccc gtctcggcct cccaaagtgc tgcgattaca ggcatgagcc 16980 accacgcccg gccctaccaa gtgctagctt tcatttgacg cagtgaatgt ttcttgtaca 17040 cctggcaggt gcctggcact gcataggcac tgttgagatg tgaaggtggc cctggggaca 17100 gaaaattata ctgggcttga ctgtgtgtct ccatcccttg acatcagcca agccagcagc 17160 tgctttacat acatgatgag cagacagctg cttgaaagag atgaggaaac tcccagacca 17220 acggctctta ccagagggcc aagggaggtc cccacagagt cagaggctgc agctggtccc 17280 tgaaatccag gcagaatttt agaaatgaag acagtcagct gggtgcagcg gctcatgcct 17340 gttatctcag ccacttcgga gggctgaggt gagaggattg cttgagccca ggaggtggag 17400 gctgcagcaa gctatgatga caccatgcat tccagcttgg gcgacagagc gagaccctat 17460 ctctaaaata aaaatgaaga agacagttaa tgacgtctcc tccctgtctg cctcactggg 17520 taagcattcg cccagccaac atctggaaca tcccagttct gcaaagagcc acacccttcc 17580 cagaaagagc ccaacttgcc aaagatttac ttatttgttt taaactggtt ttagttgacc 17640 gcttttcatt ttgtgtatag cagcgtttta aggaaggtct aatttatcca ggccacctgc 17700 tgctttagca aaccaaggga gaggatgtga gattctaagg aatttacata tgtatgtcat 17760 atatatatat atatatatat agacacacaa tttttttttg agacagggtc ttgctctgtc 17820 atacaggctg gagtgcagtg gcacaatcat agctcactat agcctcagat gcctgtgctc 17880 aagcaatcca ctcacctcgg cctcctgagt agtgagacta caggcacaca ccaccacacc 17940 cagctaattt tttaattttt tgtagagact gagtcttgct gtgtcgccca ggctagtctt 18000 gaactcctgg gctcaagcaa tcctcccaca ttggcttccc aaagtgctag gattacaagc 18060 gtgagccact atgcctggct tatttttaag gttatatgca tgcaaagcct gtatcaatga 18120 aaatattttc tttggttttt ttcaactttt catcttcgca ttttgcagat ttatagaaaa 18180 tttgctaaaa taataagtcc attgaataca tacacaccct tcaccaaggt tcaccaattc 18240 gtaactgcca tatttgggag ttatatgtgt gtctctctat atatacatat atggatacag 18300 atacatatac atgtttagtg acttgtttat atttgtacat acatgtacat gttgttattt 18360 attgatcgtt tgggagtaag ttgcagggat cattgactcc cccacaatta tgctagatat 18420 tctcaaaaga aggaccttct cttttttttt tttttttttg gagacagggt atcactgtca 18480 ttgaggctgg agtgcagtga tgcgatcaca gctcactgca gcctcaacct cccaggctca 18540 agtgatcctc ccacctctgc ctcccaagta gctgggacta caggcacggg ccaccacgcc 18600 tggctaggca ttctgttatg taattatcaa ttgtatctta tagttcagtg atcacatttt 18660 ggaaatgtaa cattgatacc attatctaat acacagacca tattcaaatt ttgcctattg 18720 tctctatact gaactactga gctgtccttt atagcaatct ccccctcatc cacagtccag 18780 tccatgatca acattgcatt taatcgtcat gtgtcatcag tatctttttt tttttttttt 18840 ttgagacgga attttgctct tgttgcccag gttggagcgc aatggcgcaa tcttggctta 18900 ttgcaacctc cgcctttggg cttaagtgat tctcctgcct cagcctccta agtagctgag 18960 attacaggcg tgcaccatta tgcatgccta atttttgtat ttttattaga gacggggttt 19020 taccatgttg ccctggctgg tcttgaactc ctgacctcaa atgatccacc cacctcagcc 19080 tcccaaaatg ctgggtttac aggcatgagc cactgcgtct ggccatttcc tcagcctttc 19140 attgcccttc atgatcttga catttttgaa gtgtacaggc cagtcattaa agtaaaatgt 19200 ttttcctttt tttttttttt ttttttaaaa agagacaggg tctcactgtg ttgcccaggc 19260 tggtctcaga ctcctaggct caagtgatcc tcccgcctca gcttcccaaa gtgctgggat 19320 tacaggcgtg agccatcgta cctgccctcg catttgggtt tgactgatgt ttcctcttag 19380 ggagacaggc tctgcaggtt tggcctgata ctgcataagt gatcctctgt ccttccgagt 19440 ggatcttgcc aggagacata tgatgtcagt gtgccctttg ctgaggatgt tcactttgat 19500 tacttgtttt ttctgtactg taaggatttt tttccctttg tcatcaataa accatttgtg 19560 agatttgagt ctgtaaatat cctgttccca aaaacccttc cccaaatgat ttgagcatct 19620 attgatgatt cttgcctgta gcgattatta ctagggtggc taccaaatgc tgaatttcta 19680 actctgttct tccttctgca tttgttactg taaggaagag cttctccccc atacgagaat 19740 agtctttttg tttgcttggt tgtttttttg agatagggtc tcactctgtt gcccaggctg 19800 gagtgcagtg acatgatcat agctcactgc agcctcgacc tcatgggctc aagcgatcct 19860 cctgcctcag cctctcgagt agctgggact acaggcagca ccaccatgcc tggctaattt 19920 tttatttttt gtaatggtga ggtctcacta ttttgctcag gctggtctcg aactcctgac 19980 ctcaagtgat cttcccacct cagcctccca aatagctggg attacaggag tgtgccacca 20040 tgctcagcta attttctgta aaaaatgtca tagagatggg gtcttgctat gctgcccagg 20100 ctggtctcaa acccctagtc tcaagcaatc ctcccacctt ggcctcccaa agtgctggga 20160 ttccaggcat gagccaccac acctggccct gtttttctta aagttctcag tctcctctct 20220 gccttacccc catccccttt tccatctcca ggacctaggg cagagacaaa gtgagcattc 20280 cctaaaaagc ttttatgagg caaaatgaaa accagctcac gcctataatc ccagcacttt 20340 gggaggccaa ggtgggtgga ttacctgagg tcaggagttc aagaccagcc tgaccaacat 20400 agagaaaccc catctgtact aaaaatacaa aattagccag gcatggtggc acatgcctgt 20460 aatcccagct actcaggagc ctgaggcaag agaatcactt gaacctggga ggcggaagtt 20520 gcaatgagcc gagatcactc cattgcactc cagcctgggc aacaagagca aaactctgtc 20580 tcaaaaaaaa aaaagaaaag aaaagaaaac caggtcccta acaccgaaga gttaaaagaa 20640 ataagtaaat ttggcaaatt ggtctttttg tgagttagct tataggcaac tgatcgaggg 20700 tctctttccc gtcttcaccc tgcaattgtg gctcagggca agctgccagc tccctcctgc 20760 caatgcagga gcaatagagc ttggcctcct cttgcagggc gagtttggga gtcagatatg 20820 aagccactaa tccgggacct ttttgggacc caaggcactc atctgcccca agcataccag 20880 gcaggccagg tgcaatgact catgtctgta atcctagcac tttgtttttg cgacggagtc 20940 tcgctctgtc cacccaggct ggagtgcagt ggcagaatct tgactcactg caacctccac 21000 ctcccaggtt caagcaattc ctgcctcagc ctcccaagta gctaggacta caggcgccca 21060 ctgccacgct cggctaattt ttgtattttc agtagagacg gcgtttcacc atgttggcca 21120 ggctggtctc aaactcctga cttcaagtaa tccatccacc ttggcctccc caactgttgg 21180 gattacaggt gtgagccact gcgcccggcc agtcctagcc ctttgggagg ctaaggcggg 21240 cggattgcat gagctcagga gttcgagacc agcctgggaa atgtggtgta accccgtctc 21300 tactaaaaat acaaaaaaaa ttagctgggt gtggtggtgt gcacctgtaa tcccagctac 21360 tcaggaggct gaggtacgag aatcgcttga actcaggagg cagaggctgc agtgagctga 21420 gattgtgcca ttgcactcca gcctgggtaa cagagtgaga ttctgtctcc aaaaaaaaaa 21480 aaaaaaaaaa ttcgagacca aacatacctg ggatttggaa ggatagatct gttcccccag 21540 ggtggagaca atggtccatt gaatgggaac agctgagcat cttgtgtggg tggccagtgc 21600 ctacaagcgt gccacctttc tccagctcac acctgtggca gacatcagta attgattaca 21660 gaattcctcc cctgaaacca gaactcggtg ttctggccat ctgctacttc ccagtcacac 21720 gaagtagaat cctccacctg ctcaccctgg atctggtgcc cttcgccttg gtttcctgtt 21780 ggggctctga gggacaggtg ggcactggcc tgacccctgc cttacccaca gagtggatcc 21840 gggctgcatg agcccagatg tgaagaattc catccacgtc ggagaccgga tcttggaaat 21900 caatggcacg cccatccgaa atgtgcccct ggacgaggta cggtcctgag tctgtggggc 21960 aggacgggag gtagtgcctt catgcctagc cccctcccca ctccaccccc attcacatgc 22020 ctgctgtccc cagattgacc tgctgattca ggaaaccagc cgcctgctcc agctgaccct 22080 cgagcatgac cctcacgata cactgggcca cgggctgggg cctgagacca gccccctgag 22140 ctctccggct tatactccca gcggggaggc gggcagctct gcccggcaga aacctgtctt 22200 gtaagtcagc ctgctcctcg gttcagctgg gtgctttcac tcctgctggg gctcaggggc 22260 tgtgggacct aggtcgggga gccagccctg cacaaatgca gcccaggctt gagccaggga 22320 ggtggaggct gcagtaagct gtcatcacac cactgctctc cagcttgggt gacaaaacaa 22380 gacccactct caaaaaaaaa gaggaaacac acatttttta aaaagccggg gacggggcca 22440 ggcgtggtgg ctcatgcctg taatcccagc actttgggag gccgaggcag gtggatcacc 22500 tgaggtcagg agttcaagac cagcctggcc aacatgggaa acctcatctt tactgaaaat 22560 acaaaaatta gccgggcttg gtggcaggtg cctgtagtcc cagctactca ggaggctgag 22620 gcagatgaat cacttgaacc caggagatgg aggttgcagt gagccaaggt cacgccacta 22680 tactccagcc tgggcaacag tgtgagactc tgtctcaaaa aaaaagagga tgacagagca 22740 ggatctgagg ggttgagggg agctgggggc tgccactaga gccaggatag gccgagacac 22800 tgggatgggc agcctttgga ctgtcccagg cgggccctcc caaagcaggg ggtgattgca 22860 tagactggca tggacagggg catgcaggca ggaggaggaa ggggcagggc cttggccggg 22920 tgctacctgt cccccggtgg cacttggcac catgtgtgcc ccccaggagg agctgcagca 22980 tcgacaggtc tccgggcgct ggctcactgg gctccccggc ctcccagcgc aaggacctgg 23040 gtcgctctga gtccctccgc gtagtctgcc ggccacaccg catcttccgg ccgtcggacc 23100 tcatccacgg ggaggtgctg ggcaagggct gcttcggcca ggctatcaag gtacagagca 23160 tgccagggtc tcaggggaca gtctgggtgg gacccctcca tcctccttcc ttcccagtct 23220 atggaaacac agtggaaggg gtatctggct tccagactcc ctggccagtg ccctctcctc 23280 ccttggcctc ctggagctaa ttaggaacag gggacctcct acaggtagac tgagacctta 23340 tgtgcgggag gtcattgaaa ggtggctcct agccaggcac agtagtttat ccctgtaatc 23400 ccagcaccat gagaggctaa ggctgtagga tcgcttgagc ccaggaattc aagaccagcc 23460 ttgacatcat ctctacaaaa aatttaaaaa ttaattgggt atagtggtgc atgcctgtgg 23520 tcccagctac ttgggaggct taggcaggag gattgtgagc caggagttca aggctgcagt 23580 gagctatgat catgccacag cactccagcc tgggcaatag agcaagaccc catctcaaaa 23640 aaaaaaaaaa aaagacaagg gattaataca tcccatccac ttgggtattt gggaacatcc 23700 catgcacagc ctagagtatg aagccatctg cacatctccc tggcagtcct ggggtggaga 23760 tggggcttcc tagaaggcgg gcttacagca gagcttctgt cttcacacct ctgtgtccca 23820 cacgcaggtg acacaccgtg agacaggtga ggtgatggtg atgaaggagc tgatccggtt 23880 cgacgaggag acccagagga cgttcctcaa ggaggtcagt gagcggaatg ccctcttccc 23940 tccagaggga cttccaggtg ctcacccctg ccccatcaac acaggtcgga aaagggctct 24000 gggaaccatt gaaagaagag cgagcaggcc aggcatagtg gctcacgcct gtaatcccaa 24060 cactttggga ggttaaggag agaggatact ttgagaccaa cctgggcaac atagcaagac 24120 cccgtctcta caaaaaaatt ttaaattaac cgagcttggc aatgtgcacc tgtcatccca 24180 gctactcggg gggctgaggt gggaggctcg cttgagccca ggagttggag gctgcaatga 24240 gccatgatcg caccactgca ctccagcctg gggaacaagg caagaccctg tgtccaaaaa 24300 aaataaaagt aactgcattg gtcgggcata gtggctcacg cctgtaatcc cagcactttg 24360 ggaggctgag ccgggcggat cacctgaggt caggagttcg agactaccct ggccaacatg 24420 gcaaaacccc gtctctacta aaaatacaaa aattagccca gcatgatggt ggtgagtgcc 24480 tgtcatccag gctactcagg aggctgaggc aggagaattt cttgaactca ggaggcggag 24540 gttgcagtga gccaagatcg tgccgctgcc ctccagcctg ggcgacagag tgagactcct 24600 tctcaaaaaa aaaaaaaaga aaagaaaaaa gaaagtaact gcaggcaggg gactgggaaa 24660 aagagcatcg ctgggggtgg gggcagctca agcagagggc acaggacgcc agagggtgtg 24720 gcagaggcag gagaggggag ctgggggttc cgtatctttg agaccgccta cagcccctgg 24780 tgggatggaa aagggagaag cagacccaag cacagctggg accacacaga gcccgggccc 24840 agcctgtttg tgccccgcca ggtgaaggtc atgcgatgcc tggaacaccc caacgtgctc 24900 aagttcatcg gggtgctcta caaggacaag aggctcaact tcatcactga gtacatcaag 24960 ggcggcacgc tccggggcat catcaagagc atggtgagtc ctgggcagag ccagccaccc 25020 ccgctgtgcg gccccgggca aagcagctcc ctctgtgagc ctcagtctca tctcttcaat 25080 ggggggaagc cacaggggtc tcaaaggccc tctgaaccct gattcctaat caaaaagggg 25140 agcgactgac tccatctaaa gctaggaaag gccaggtaca atggtgcaca cctgttattc 25200 tggcactttg ggagcccaag gcaagaggat cactcgaggc caggaattca aggctgcagt 25260 gagctgtgat ctcaccactg cactccagcc tggaccacac agcaagaccc tgtctcaaaa 25320 actaaaataa aattcagagc tttccttaag gatttgaata aaattacaaa tccatcttta 25380 gaaataaagt gctcaggcca ggtgcagtgg ctcatgccta taatctcagc actttcagag 25440 gctgaggcca gcagatcacc tgaggtcagg agtccaagac cagcctggcc aacatggtga 25500 aaccccgtct ctactaaaaa tacaaaaatt agctgggcct ggtggcaggc acctgtaatc 25560 ccagcacttt gggagactga ggttggcaga tcacctgagg tcaggagttc gagaccatcc 25620 tggtaacccg tctctactaa aaatacaaaa aattagccgg gcaaggtggc aggtgcctgt 25680 agtcccagct actcgggaga ctgaagcagg agaatggcgt tgaacccagg gggcagagcc 25740 tgcagtgagc caagatcgca ccactgcact ctagcctggg tgacagcgag attccttctc 25800 aaaaaaaaag cacttggagg aagcctcaca gagccctgtg ctggaccaca ccctggggat 25860 ccagtcctgg cctccagccc catttctgta ccaccctgag accatgggat cttcctcagg 25920 ttggattacc ttgtatccaa ggtgtggacc ctatgggctc ctgctaggtg taacttgaca 25980 caacgggttc cgttgtcagg tgcaatttag aaactctggg ctaggccaag cgcagtggct 26040 cacacctgaa ttcccaaact ttggaaggcc gaggcaggag ggtcactaga ggtcaggagg 26100 tcaagaccag cttggacaac ataatgagat cccaatccca tctctacaaa aaaaattaaa 26160 aaattagcca aatgtggtga cacatgcctg tggttccagc tccacaggag gctgaggcag 26220 aaggatcact tgagcacagg aggtcgaggc tgcactccag cctgggtgat agagtgagac 26280 cctgtctcaa taaaaaataa agatctccaa ggggatgagg tttgagaatg aggcgtctcc 26340 cccaaatgat ttgagcccaa agccccgttc tcctggcatg gctcagtgct gccactgcgc 26400 aggtgacctt gctgggccct tctacctctt acctgtctgt gaaagtaggt tctaattttt 26460 taaaaaccta gaaagatgag ttttttgttt ttgtttttgt ttttcccgag atggagtttt 26520 gctcttattg tccagcctga agtgcaatgg cgtgatctcg gctcactgca acctccacct 26580 cccaggttca atcgattctg cctcagcctc ccgagtagct gggattacag gagcccacca 26640 ccacacccgg ctaatttttg cgtttttagt agagacaggg tttcaccatg ttggtcaggc 26700 tggtctcaaa ctcctgacct cgtgatccaa ccactctgac ctcccaaagt gttgggatta 26760 caggcgtgag ccaccacacc tgacagaaag atgagatttt atagaaaata aatatagctt 26820 gttttctcag aggaggcaga ttgggagcta tagaggaata tccctgctta gagtttgaaa 26880 tcagttctgt taggaaataa tgtttgtagg ggccgggtgc ggtggctcac gcctgtaatg 26940 ccagcacttt gggaggctga ggcaggtgga tcacttgagg ttaggagttt gagaacagcc 27000 tggccaacat ggtgaaaccc tgtctctact aaaactacaa aaattagctg ggtttggtgg 27060 tggacacctg taatcccagc tacttgggag gctgaggcga gagaattgct tgaggccggg 27120 tgcagtggct catgcctgta atcccaacac tgggaggcca aggtgggcag atcacctgag 27180 gtaaggagtt caagaccagc ctgaccaaca tggtgaaacc ccgtctctac taaaaataca 27240 aaaaattagc tgggtgtggt ggcgcatgcc catagtccca gctactcagg aggctgagac 27300 acaagaatca cttgagcccc ggaggcgaag gttgtaggga gctgagatgg taccactgca 27360 ctccaccctg ggtgacagag tgagactcca tctaaagaaa aaaaaaaaag gaaataatgt 27420 ctgtgagctg tgttgactca tactccttag aagcagacag ttgtgggtgc ccgaagaaat 27480 cggggtgttg gggagcccag ggaccctcta ggacgcttgc ctcttcctgc ctctgtctca 27540 tgcaaccatc cctgccatcg gggcccccac cggccccacc ctggccattc tttctccatc 27600 ccaggacagc cagtacccat ggagccagag agtgagcttt gccaaggaca tcgcatcagg 27660 gatggtgagt gagccgggtg ctctagctcc attcataatc ccaccaggaa tttgcaaaca 27720 gaacccacaa agaagctttg aaagagggca gagggggtcg atgggagagt gggaagaatc 27780 gtcccgactg gcctgattgg ggtgggagca gagggagttc ctggggagcc aggatgggct 27840 ggggtccctc tgcacagctg ccccctgact cccgtgtccc cgtccctagg cctacctcca 27900 ctccatgaac atcatccacc gagacctcaa ctcccacaac tgcctggtcc gcgaggtgag 27960 taccagggcc ccacgtggct gggtgtcagg agacagcagg agcccatcca accccagcct 28020 cagggccttc ccagaactgg aggcccctcc atgttgcctc catgacttca atttgaggtg 28080 ggggtggggg gcagcagccc gtggggaaga gcgcagggtc aggaggcaga cagacctggg 28140 tttgagtcct gtctctgcca ctgactcatg gtggaccatc agagtcccag gctggtagga 28200 gggtctcata aatcaatgaa ggagaaagtg acatgtaagc tacaaaggac caggaccgtg 28260 gtcttcatag agcacagccc atggcagagt ggccatgggc tacaccagac agcaccagca 28320 tctgggggcc acagagtggg ggcataggcg tatgggctgg agtggtcagg gcaggcttcc 28380 tgaaagagga ggcttggcca gacacagtgg ctcacacctg taatcccagc actttgggag 28440 gccgaggcag gcggatcacg aggtcaggag atcgagaccg tcctggctaa catgggcact 28500 gtggctcaca cctacaatcc caacactttg ggaggccgag gtgggtggat cacttgaagc 28560 caggagttca agaccagcct ggccaacatg gctaacacgg tgaaacccca tctctactaa 28620 aaatataaaa aattagccgg gcgtggtggc aggtgcctgt agtcccaact acttgggagg 28680 ctgaagcagg agaatggtgt gaacccggga ggcggaactt gcagtgagcc aagatcgcgc 28740 caccgcactc cagcctgggt gacagagcga gactccatct caaaaaaaaa aagaggaggc 28800 tttaggtgga tatttaagca ggggacgggc aggcaaagag cccagtgtct aaggattgtc 28860 aagggaggag agcccggttc tccaccaaaa gcacaggagc gagtaaccat gcccatctgg 28920 agaggtggtg tattcgtgtc ctggggctgc catcatgaag tactgtgaac cagatggctc 28980 aaaacaacag aaatgtgctg ggcacagtgg ctcacaccta aaatcccagc aatttgggag 29040 gccaaggcag gtggattgct tgagctcagg agtttgagac cagcctgggc aacattacga 29100 aagcccatct ctgccaaaaa tacaaaacgg aatagccagc cgtggtggca taagcctatg 29160 gtcccaacta cctgggaggc tgaggtggga ggatcacttg agcctgggag gtagaggttg 29220 cagtgagcca agattgtgct actctactcc agcctgggag acagagccag accctgtctc 29280 aaaaaaacaa aacaaaacaa ggtcaggcac tgtggctcac gcctgtaatc ccagcacttt 29340 gggaggccga agtgggtgga tcacttgaag ccaggagttc aagaccagcc tggccaacat 29400 ggcaaaaccc tgtttctact aaaaattcaa aaattagcag gcatggtggc gcatgcctgt 29460 aatcccagct actcgggagg ctgaggcagg agaattgctt gaacccagga ggcagaggtt 29520 gtagtgagct gagattatgc cactgcactc cagcctgggt gatagagtca gacaccgtct 29580 caaaaaaaaa aaagcatcac atggcaagag gggctgacaa gagaccccca aactgaccat 29640 tatacagacc cactcttgtg ataactaacc tggtccctca ataacccatt aatctgttaa 29700 ttcatacaga gccctcatga cccaatcacc tcttacaggc cctgcctctt aataccgtta 29760 gagtcaggcc aggcatggtg acatgggcct gtagtcccag ctagttggaa ggctaggtgg 29820 gaggatccct tgagtccagg aggtaaatgt tacagtgagc tctgattgtg tcactgcact 29880 ccagcctggg caacagagcg agcccctgtt tttaaaacag caacaagcca ggcacagtgg 29940 ctcacgcctg taatcccaac actttgggag actgaggcag gcagatcact tgaggtcagg 30000 agttcaagac cagcctcacc aacacagtga gacccctctc tactaaaaat acaaaaatta 30060 gctgggcgtg gtggtgggtg cctgtagtct cagctactca tgagactgag gcagaattgc 30120 ttgaacccgg gaggtggagg ttgctgtgag ccgagatcac gtcactgcac tccagcaaca 30180 gagtgggact ccatctcaaa aaaaataaaa aataacagag atctgtgttg gcttacacct 30240 gtaatcccag cactttggga gtccaagatg ggcagattgc ttgagcccag gagtttgaga 30300 ccagccaggc aacatggcaa aaaaataaaa aaatttgtct ctacaaaaaa attaaaaaat 30360 tagctggcat ggtggtgagt atctatagta ccagctactc aggaggtgga ggtgggagga 30420 tcgcttgagc ctgggaagtt gaggctgcaa tgagctgtgt tcgtgccact gcactccagc 30480 ctgggccacg ggagggagac tctgcctcaa aaaaaaaaaa aaaaaatcaa acccgaaaag 30540 caaaaaacat agacctcacc tgcttattgg gaatattcaa gataaaatta ggccaggcac 30600 ggtggctcac gcctgtaatc ccagcacttt gggaggccga cgtgggcgga tcacgaggtc 30660 aggagatcga gaccatcctg gctaacacgg tgaaaccccg tctctactaa aaatacaaaa 30720 aattagctgg gcatggtggc aggcgcctgt agtcccagct acttgggagg ctgaggcagg 30780 agaatggcgt gaacctggga ggcagagctt gcagtgagct gagatcgtgc cactgcactt 30840 caacctgggc aatagagcaa gactccaact caaaaaaaaa aaaaaaaaga taaaattggg 30900 ccaggtatgg tggcttactc ctgtaatccc agcactttga aaggctgagg caggtggacc 30960 acttgaggcc agaagttgaa gaccagtctg ggcaacatag caagacccta tctcaatcag 31020 tcaatcaacc taaataaata gtaaatctgg tggcatgcca agcacaggac ctgggtctat 31080 aatcaaaatt cctgtcttga tgggcacagt ggctcacacc tgtaatccca gcactttggt 31140 aggccacagt gggtggatca cctgagatca ggagttcgaa acctgcctag ccaagtatgg 31200 tgaaacccgt ctttactaaa aatacaaaaa ttagccaggc atggtggcag gcgcctgtaa 31260 tcccagctac tcgggagggt gaggcaggag aatcgcttga acctgggagg cggaggttgc 31320 agtgagccga gatcgcgcca ctgcgctcca gcctgggtga cagagcaaga ctccgtctga 31380 aaaaaaaaac aaaagaattc ctgtcttctc tccgaaacaa agcagcatca gtgcccccgc 31440 aggtgggagg gagcgcttgc aggagggagc agtgggtccg ccacgacggt ctggggagca 31500 ggtggggagg gggcagaggg tgcagcgtgt ggtgggaggg aggaagccac actgctatct 31560 tcaggtgctt cccgcagctc catttgcaaa gagcggatgg gtttggggaa ggaaggggtc 31620 cccaccctgt gccaatacag cgtatcagag gtatgttctc tgggctgtct acaggttggc 31680 ttggggtcct ggggaggggc aggccaagcg ggcagtacta ggatcgggtc ccagcatgac 31740 ccggcttcac cttcccagaa caagaatgtg gtggtggctg acttcgggct ggcgcgtctc 31800 atggtggacg agaagactca gcctgagggc ctgcggagcc tcaagaagcc agaccgcaag 31860 aagcgctaca ccgtggtggg caacccctac tggatggcac ctgagatgat caacggtgag 31920 tggttcagcc ctgcccatca tggccctcac gggaagccat gggggagccc aggagagctg 31980 taacctccca agcccctggc ccctcccagc ctccttggct cttcagttac cctgtgggtc 32040 ctgttgctcc tataacacac ttagtggcag ccaggcacgg tggctcacgc ctgtaatccc 32100 agcactttgg gaggctgagg tgagtggatc acctgaggtc agtagttgga gaccagccta 32160 gccaacatgg tgaaaccccc attctttact aaaaatacaa aaattagctg ggcatggtgg 32220 tgggtgtaat cccagtactg tagtactgta atcccagcta ctagggaagc tgaggcagga 32280 gaatcgcttg aacctgggag gcagaggttg cagtgagccg agatcgcgcc attgcactcc 32340 agcctgggtg acgagcgaaa ctccatctca aaaaataaat aaatagaaga cacttagtgg 32400 cttaaataaa tgatcataca gttctggagt ctgaagtcca gcgtcagcct caccgggctg 32460 aaatcaagac gccggtaggg tgagctcctt ctgcaggctc cggggcacct gtttcctgac 32520 cttttctggc tcgtggaggc ttcctcattc ctcctgttgc tgccccctcc tctgtcttca 32580 gggctggctg caaagcatct tctcttctct gatctctgca tccatccccg catctctttc 32640 cctggctcta accttcctcc tttttttttt cttttttaaa gagggtctcg ctctgttact 32700 caggctggag tgcagtggtg ccaccatagc tcactgcagc ctcaaccttc tgggctcaaa 32760 ctgtcatccc accccagcct cctgaatagc tgggaccaca ggcatgcaac accacaccca 32820 gctaattttt ttatttttta ttttttattt ttttttgaga cagagtctcg ctgtgtctcc 32880 caggctagag tgcagtggcg tgatctcagc tcactgcaag ctccgcctcc tgggttcacg 32940 ccattctcct gcctcagcct cccgagtagc tgggactaca ggcgcccgcc aacacgcctg 33000 gctaattttt tgtattttta gtagaaacgg ggtttcaccg tgttagccaa gatggtgtcg 33060 atctcctgac ctcgtgatcc gcccgtctcg gcctcccaaa gtgctgggat tacaggcgtg 33120 agccaccgcg cctggccaat tttttaaatt tttaatagag acgggggtat cactatgttg 33180 cccaggctgg tctcaaactc ctggcttcag gcgatcctcc tgccttgacc tttcaaagtg 33240 ctgggattcc aggcatgagc caccatggcc ctccatcctt ctgataggga cccttacggt 33300 gacattgggc ccacctggat aatccaaaag cagccctcca tctcaagacc ctcaacttaa 33360 tcccatctgc agagtccgat ggaaggtggg acgtatacaa gtcccaggga tcaggacgca 33420 gtcatctttg gggatcatag ttctgcctcc cacagggtct gcttccctca gtccatttct 33480 ttgctgtcaa tggtcctata tatgcccaga ttataggtta taaagtcctt ctacaagcag 33540 gtgacacatg aacacaggtt cagggcaggc agaccccagc catcacctca tcatagttaa 33600 cctagttaaa ttagcctggc atgtggcgtg gtgcctaatg cctgtggtcc cagctactca 33660 ggaagccaaa gcgggagatt tacttgagcc aaggagatca aggctgcagt gagctatgat 33720 cataccactg ccttctagcc tgggcaacgg agtgagaccc tgtctcaaga aaacaaaaaa 33780 taggccaggc acagtggctc acacctgtaa ttccagcact ttgggaggct gaagcaggcg 33840 gattgcttga ggccaggagt tcgagaccag cctggccaac atggtgaaac gctgtctcta 33900 ctgaaaatac aaaaattacc cgggtgtggt ggcacagcta ctagggaggc tgaggcagga 33960 gaatcacttg aacccaggag cagaggttac attgggccaa gattgcacca ctgcactcca 34020 gcctgggcaa cagaggaaga ctgtgtctca aaaagaaaaa aaaaaaaacc ttcctgtaat 34080 cccagcactt tgggaggctg aggtgggcgg atcacgaggt caagagattg agaccatcct 34140 ggtcaacatg atgaaacccc atctctacta aaaatacaaa aaaattagct gggcgtggtt 34200 gcacgcgtct gtagtcccag ctacccggga ggctgaggca ggagaatgat gtgaacccag 34260 gaggcggagc ttgcagtgag ccgagatcgc accactgtac tccagcctga cgacagagtg 34320 ggactctgtg tcaaacacac acacacacac acacacacac acacacacac acacacacac 34380 acagagttaa catagcccgc aaagaagact ataaaacagt cttagtggcc gggcgcagtg 34440 gttcacgctt gtaatcccag cactttggga ggccgaggca ggtggatcat gaggtcagga 34500 gtttgagacc agcctggcca acacagtgaa accccatctc tactaaaaat acaaaaatta 34560 gctggacatg gtttcgggcg cccgtaatcc cagctactca ggaggctgag gcaggagagt 34620 tgcttgaacc caggaggcag aggcaggaga gttgcttgaa cccaggaggc agaggttgca 34680 gtgggcgaca gagcaagact ctgtctcaaa aaacaaaaaa gtcttagtgt ttcctatgtt 34740 tagggattag tgtgaggatt aaaggttgta aactcatttc cacctagttg gcattcagta 34800 aatgagaatt gacatttagt actaattgtt tcgggtattt tgttttttgt tttttgtttt 34860 ttgttttttc tgagaccgag tcttgctctg tcatccaggc tagaatgcat ggtgcgatct 34920 cggctcactg caagctccgc ctcccgggtt cacaccattc tcctgcctca gcctcccacg 34980 tagctgggac tacaggcgcc cgccaccacg cctggctaat tttttgtatt tttagtagag 35040 acggggtttc accatgatct cgatctcctg acctcgtgat ccacccgcct cagcctccca 35100 aagtgctggg attacaggtg tgagccaccg tgcccggcca gttttttgtt tttgagatgg 35160 agtcttgcat tgtcacccag gctggagtac agtggcgtga tctcggctca ctgcaacctc 35220 cacctcctgg gttcaagtga ttctcctgcc tcagtttccc tagtagctgg gattacaggc 35280 acctgccacc atgcctggct aatttttcta tttttagtag agatggggtt tcaccatgtt 35340 ggccaggctg atcttgaact cctgacctca ggtgatccac ccgcctcggc ctcccaaagt 35400 gctgggatta caggtgtgaa ccactgtgcc cggccatgta ccgattattt ttaacatcat 35460 taagtagctg gtatcattcc cattttacaa taaggaaact gaggctcaga gagtctgtgt 35520 cagtttcctg aggttgctgt aataaattgt tagaaacttg attatttaaa acagcagaaa 35580 atggtcaggc acagtggctc acacctgtaa tcccagcact ttgggaggcc gaggcgggca 35640 gatcactgga ggtcaggagt tcgagaccag cctggccaac atggtgaaac accatctcta 35700 ctaaaagtac aaaaattagc tgggcatggt ggcaggcgcc tgtaatccca gctactcggg 35760 aaattgaggc aggagaatcg cttgaaccca ggaggcagag gttgcagtga gccacaatcg 35820 taccactgca ctcttgcctg gacaacaaag caagactcca tctcaagata aaataaacag 35880 cagaaattta ttccctctta gttttggaag ccagaaggtt gaaatccaac agggctgcgc 35940 tccctccagg gcgatctagg ggagaatgca ttccttgcct cttccacctt ctggttgttt 36000 tgcattcctg ggcttgtggc cgcatcactc cagtctccac ccctgtcttc acagggccac 36060 ctcctcctct tctgctgtgt cttctctgtg tctctctcaa gagggcattt gcagtggcat 36120 ttggggccca cccagatcat ccagcatcat ctcatctcca gatccttaac ttaatcccat 36180 ctgcaaaaga ccctttttct gacccagtaa cattcacaga ttccagagac ctgacatggt 36240 tcccttttgg gaccagcaca gagttcatga cttgtgcaaa gtcacgcagc tgatcggtgc 36300 ctcgaactcc ttgtccaggg ctctgcccct tgctcctcag agctcccaaa ggcttgctca 36360 gacctggtgg ggttggggga aagagcctaa gcctgggttc ccatagaggt tgccggcatc 36420 tgcctcctgg gcctggacct cccggccggg gcatcctccc agctggcctg gtcccctgcc 36480 ttttggcatc cctggcaccc ccatgtgttc atctgctgac agtcggtctc tttatccagg 36540 ccgcagctat gatgagaagg tggatgtgtt ctcctttggg atcgtcctgt gcgaggtagg 36600 tccagggttg ggtagcagcg gtgttgaggc ctgggctcct ccccactcac ccaggctgca 36660 ggctcagcat ctgcaggggc ctcatgccag gaagcctgcc cacagcaagg catgggctgg 36720 cccccatggg gtactgcagt caggctgcag ccaggcccag tgccacctgc cctcaaacca 36780 cctggatggc acccagatgc ccaggctgag ggccccctgg agtaactgcc gggccttgta 36840 ctggacagat catcgggcgg gtgaacgcag accctgacta cctgccccgc accatggact 36900 ttggcctcaa cgtgcgagga ttcctggacc gctactgccc cccaaactgc cccccgagct 36960 tcttccccat caccgtgcgc tgttgcgatc tggaccccga gaagaggtga gtggggtggg 37020 gccctggcct gggagacggt ggggccgatt cccgggacag ccagacccac cgttccccac 37080 ccacctgtca cccaggccat cctttgtgaa gctggaacac tggctggaga ccctccgcat 37140 gcacctggcc ggccacctgc cactgggccc acagctggag cagctggaca gaggtttctg 37200 ggagacctac cggcgcggcg agagcggact gcctgcccac cctgaggtcc ccgactgagc 37260 cagggccact cagctgcccc tgtccccacc tctggagaat ccacccccac cagattcctc 37320 cgcgggaggt ggccctcagc tgggacagtg gggacccagg cttctcctca gagccaggcc 37380 ctgacttgcc ttctcccacc ccgtggaccg cttcccctgc cttctctctg ccgtggccca 37440 gagccggccc agctgcacac acacaccatg ctctcgccct gctgtaacct ctgtcttggc 37500 agggctgtcc cctcttgctt ctccttgcat gagctggagg gcctgtgtga gttacgcccc 37560 tttccacacg ccgctgcccc agcaaccctg ttcacgctcc acctgtctgg tccatagctc 37620 cctggaggct gggccaggag gcagcctccg aaccatgccc catataacgc ttgggtgcgt 37680 gggagggcgc acatcagggc agaggccaag ttccaggtgt ctgtgttccc aggaaccaaa 37740 tggggagtct ggggcccgtt ttccccccag ggggtgtcta ggtagcaaca ggtatcgagg 37800 actctccaaa cccccaaagc agagagaggg ctgatcccat ggggcggagg tccccagtgg 37860 ctgagcaaac agccccttct ctcgctttgg gtcttttttt tgtttctttc ttaaagccac 37920 tttagtgaga agcaggtacc aagcctcagg gtgaaggggg tcccttgagg gagcgtggag 37980 ctgcggtgcc ctggccggcg atggggagga gccggctccg gcagtgagag gataggcaca 38040 gtggaccggg caggtgtcca ccagcagctc agcccctgca gtcatctcag agccccttcc 38100 cgggcctctc ccccaaggct ccctgcccct cctcatgccc ctctgtcctc tgcgtttttt 38160 ctgtgtaatc tattttttaa gaagagtttg tattattttt tcatacggct gcagcagcag 38220 ctgccagggg cttgggattt tatttttgtg gcgggcgggg gtgggagggc cattttgtca 38280 ctttgcctca gttgagcatc taggaagtat taaaactgtg aagctttctc agtgcacttt 38340 gaacctggaa aacaatccca acaggcccgt gggaccatga cttagggagg tgggacccac 38400 ccacccccat ccaggaaccg tgacgtccaa ggaaccaaac ccagacgcag aacaataaaa 38460 taaattccgt actccccacc caggtcctgc gtggcgatgt gtgtctgggg ccctggggaa 38520 atagtcaagg taagaggagt tagtcttccc tgaccagaag acaaggatga gtgtggtggc 38580 tcatgcctgt gatcccagca ctctgggagg ctgagacagg acgatccctt aagcccagga 38640 gttcaagacc agtctggaca acatagtgag atcctgtctc tacaaaaatt tttttttaat 38700 tagttgggca gaggccaggt gtggtggctc atgcctgtaa tcccagcact ttgggaggca 38760 gaggcgggtg gatcacctga agttaggagt tcaagaccag tctggccaac atggtgaaaa 38820 ctcgtctcta ctaaaaatac aaaaattagc cgggcgtggt ggcacatgcc tgtagtccta 38880 gctacttggg agactgaggc aggagaatcg cttgaacccg aaaggcagag gttgcagtga 38940 gccgaggtgg tgccattcca ctccagcctg ggaaagagcg agactttgtc tccaaaaaaa 39000 a 39001 15 342 DNA H. sapiens 15 ctccataatg tgtgcccaaa tggcacgccc atccgaaatg tgcccctgga cgaggtacgg 60 tcctgagtct gtggggcagg acgggaggta gtgccttcat gcctagcccc ctccccactc 120 cacccccatt cacatgcctg ctgtccccag attgacctgc tgattcagga aaccagccgc 180 ctgctccagc tgaccctcga gcatgaccct cacgatacac tgggccacgg gctggggcct 240 gagaccagcc accctgcagc tctccggctt atactcccag cggggaggcg ggcagctctg 300 cccggcagaa acctgtcttg taagtcagcg tgctcctcgt tc 342 16 20 DNA Artificial Sequence Antisense Oligonucleotide 16 aacctcatgc actcgacggg 20 17 20 DNA Artificial Sequence Antisense Oligonucleotide 17 ctgctgcagc cgtatgaaaa 20 18 20 DNA Artificial Sequence Antisense Oligonucleotide 18 gtcaacctca tgcactcgac 20 19 20 DNA Artificial Sequence Antisense Oligonucleotide 19 caggtgcaac aaagtagcgt 20 20 20 DNA Artificial Sequence Antisense Oligonucleotide 20 ccctccaggt gcaacaaagt 20 21 20 DNA Artificial Sequence Antisense Oligonucleotide 21 ttcttccctc caggtgcaac 20 22 20 DNA Artificial Sequence Antisense Oligonucleotide 22 atacgttctt ccctccaggt 20 23 20 DNA Artificial Sequence Antisense Oligonucleotide 23 tctcccatac gttcttccct 20 24 20 DNA Artificial Sequence Antisense Oligonucleotide 24 cttcctctcc catacgttct 20 25 20 DNA Artificial Sequence Antisense Oligonucleotide 25 gcaactcgct tccttcctct 20 26 20 DNA Artificial Sequence Antisense Oligonucleotide 26 agggcctgga ggtactggcc 20 27 20 DNA Artificial Sequence Antisense Oligonucleotide 27 tccttcttgc agaagagctg 20 28 20 DNA Artificial Sequence Antisense Oligonucleotide 28 agtagtcctt cttgcagaag 20 29 20 DNA Artificial Sequence Antisense Oligonucleotide 29 ggcccagtag tccttcttgc 20 30 20 DNA Artificial Sequence Antisense Oligonucleotide 30 tagcgggccc agtagtcctt 20 31 20 DNA Artificial Sequence Antisense Oligonucleotide 31 cgccatagcg ggcccagtag 20 32 20 DNA Artificial Sequence Antisense Oligonucleotide 32 agcttggagt gctccaccag 20 33 20 DNA Artificial Sequence Antisense Oligonucleotide 33 agtacagctt ggagtgctcc 20 34 20 DNA Artificial Sequence Antisense Oligonucleotide 34 gcagtgcccg cagtacagct 20 35 20 DNA Artificial Sequence Antisense Oligonucleotide 35 acatctgggc tcatgcagcc 20 36 20 DNA Artificial Sequence Antisense Oligonucleotide 36 tcttcacatc tgggctcatg 20 37 20 DNA Artificial Sequence Antisense Oligonucleotide 37 ggaattcttc acatctgggc 20 38 20 DNA Artificial Sequence Antisense Oligonucleotide 38 tggatggaat tcttcacatc 20 39 20 DNA Artificial Sequence Antisense Oligonucleotide 39 atttccaaga tccggtctcc 20 40 20 DNA Artificial Sequence Antisense Oligonucleotide 40 gccattgatt tccaagatcc 20 41 20 DNA Artificial Sequence Antisense Oligonucleotide 41 ggcgtgccat tgatttccaa 20 42 20 DNA Artificial Sequence Antisense Oligonucleotide 42 tcagctggag caggcggctg 20 43 20 DNA Artificial Sequence Antisense Oligonucleotide 43 tcgagggtca gctggagcag 20 44 20 DNA Artificial Sequence Antisense Oligonucleotide 44 gtttctgccg ggcagagctg 20 45 20 DNA Artificial Sequence Antisense Oligonucleotide 45 gacaggtttc tgccgggcag 20 46 20 DNA Artificial Sequence Antisense Oligonucleotide 46 ctcaagacag gtttctgccg 20 47 20 DNA Artificial Sequence Antisense Oligonucleotide 47 agctcctcaa gacaggtttc 20 48 20 DNA Artificial Sequence Antisense Oligonucleotide 48 aggtccttgc gctgggaggc 20 49 20 DNA Artificial Sequence Antisense Oligonucleotide 49 gacccaggtc cttgcgctgg 20 50 20 DNA Artificial Sequence Antisense Oligonucleotide 50 tcacggtgtg tcaccttgat 20 51 20 DNA Artificial Sequence Antisense Oligonucleotide 51 accttcacct ccttgaggaa 20 52 20 DNA Artificial Sequence Antisense Oligonucleotide 52 gcatcgcatg accttcacct 20 53 20 DNA Artificial Sequence Antisense Oligonucleotide 53 tccaggcatc gcatgacctt 20 54 20 DNA Artificial Sequence Antisense Oligonucleotide 54 tgtccatgct cttgatgatg 20 55 20 DNA Artificial Sequence Antisense Oligonucleotide 55 ctggctgtcc atgctcttga 20 56 20 DNA Artificial Sequence Antisense Oligonucleotide 56 aggtaggcca tccctgatgc 20 57 20 DNA Artificial Sequence Antisense Oligonucleotide 57 gtctcggtgg atgatgttca 20 58 20 DNA Artificial Sequence Antisense Oligonucleotide 58 ttgtgggagt tgaggtctcg 20 59 20 DNA Artificial Sequence Antisense Oligonucleotide 59 cattcttgtt ctcgcggacc 20 60 20 DNA Artificial Sequence Antisense Oligonucleotide 60 aaaggatggc ctcttctcgg 20 61 20 DNA Artificial Sequence Antisense Oligonucleotide 61 cgccggtagg tctcccagaa 20 62 20 DNA Artificial Sequence Antisense Oligonucleotide 62 gtggccctgg ctcagtcggg 20 63 20 DNA Artificial Sequence Antisense Oligonucleotide 63 ccacctcccg cggaggaatc 20 64 20 DNA Artificial Sequence Antisense Oligonucleotide 64 ctctgggcca cggcagagag 20 65 20 DNA Artificial Sequence Antisense Oligonucleotide 65 cctccaggga gctatggacc 20 66 20 DNA Artificial Sequence Antisense Oligonucleotide 66 tctgccctga tgtgcgccct 20 67 20 DNA Artificial Sequence Antisense Oligonucleotide 67 actccccatt tggttcctgg 20 68 20 DNA Artificial Sequence Antisense Oligonucleotide 68 ccctgaggct tggtacctgc 20 69 20 DNA Artificial Sequence Antisense Oligonucleotide 69 ccggccaggg caccgcagct 20 70 20 DNA Artificial Sequence Antisense Oligonucleotide 70 gcagctgctg ctgcagccgt 20 71 20 DNA Artificial Sequence Antisense Oligonucleotide 71 tcacagtttt aatacttcct 20 72 20 DNA Artificial Sequence Antisense Oligonucleotide 72 aagcttcaca gttttaatac 20 73 20 DNA Artificial Sequence Antisense Oligonucleotide 73 tgagaaagct tcacagtttt 20 74 20 DNA Artificial Sequence Antisense Oligonucleotide 74 tgcactgaga aagcttcaca 20 75 20 DNA Artificial Sequence Antisense Oligonucleotide 75 caaagtgcac tgagaaagct 20 76 20 DNA Artificial Sequence Antisense Oligonucleotide 76 tcccacctcc ctaagtcatg 20 77 20 DNA Artificial Sequence Antisense Oligonucleotide 77 ggtgggtccc acctccctaa 20 78 20 DNA Artificial Sequence Antisense Oligonucleotide 78 ggtgtgtcac cttgatagcc 20 79 20 DNA Artificial Sequence Antisense Oligonucleotide 79 gctgctgcag ccgtatgaaa 20 80 20 DNA Artificial Sequence Antisense Oligonucleotide 80 ggagtgaaag cacccagctg 20 81 20 DNA Artificial Sequence Antisense Oligonucleotide 81 taggtcccac agcccctgag 20 82 20 DNA Artificial Sequence Antisense Oligonucleotide 82 ctcagaccct tgagttactg 20 83 20 DNA Artificial Sequence Antisense Oligonucleotide 83 actgcctctc gggttcaagc 20 84 20 DNA Artificial Sequence Antisense Oligonucleotide 84 gtcacaacac ctgcaggaga 20 85 20 DNA Artificial Sequence Antisense Oligonucleotide 85 ccaggtgagt gtgcaaggca 20 86 20 DNA Artificial Sequence Antisense Oligonucleotide 86 gcagtgcccg ctgcaagagc 20 87 20 DNA Artificial Sequence Antisense Oligonucleotide 87 atgctgcagc tcctcctggg 20 88 20 DNA Artificial Sequence Antisense Oligonucleotide 88 ggtccttgcg ctgggaggcc 20 89 20 DNA Artificial Sequence Antisense Oligonucleotide 89 cattcttgtt ctgggaaggt 20 90 20 DNA Artificial Sequence Antisense Oligonucleotide 90 tagctgggat tacagtacta 20 91 20 DNA Artificial Sequence Antisense Oligonucleotide 91 ctggacctac ctcgcacagg 20 92 20 DNA Artificial Sequence Antisense Oligonucleotide 92 gtcctgcccc acagactcag 20 93 20 DNA Artificial Sequence Antisense Oligonucleotide 93 gcatgaaggc actacctccc 20 94 20 DNA H. sapiens 94 cccgtcgagt gcatgaggtt 20 95 20 DNA H. sapiens 95 ttttcatacg gctgcagcag 20 96 20 DNA H. sapiens 96 gtcgagtgca tgaggttgac 20 97 20 DNA H. sapiens 97 acgctacttt gttgcacctg 20 98 20 DNA H. sapiens 98 gttgcacctg gagggaagaa 20 99 20 DNA H. sapiens 99 acctggaggg aagaacgtat 20 100 20 DNA H. sapiens 100 agaacgtatg ggagaggaag 20 101 20 DNA H. sapiens 101 ggccagtacc tccaggccct 20 102 20 DNA H. sapiens 102 cagctcttct gcaagaagga 20 103 20 DNA H. sapiens 103 cttctgcaag aaggactact 20 104 20 DNA H. sapiens 104 gcaagaagga ctactgggcc 20 105 20 DNA H. sapiens 105 aaggactact gggcccgcta 20 106 20 DNA H. sapiens 106 ctactgggcc cgctatggcg 20 107 20 DNA H. sapiens 107 ctggtggagc actccaagct 20 108 20 DNA H. sapiens 108 ggagcactcc aagctgtact 20 109 20 DNA H. sapiens 109 agctgtactg cgggcactgc 20 110 20 DNA H. sapiens 110 ggctgcatga gcccagatgt 20 111 20 DNA H. sapiens 111 gcccagatgt gaagaattcc 20 112 20 DNA H. sapiens 112 gatgtgaaga attccatcca 20 113 20 DNA H. sapiens 113 ggagaccgga tcttggaaat 20 114 20 DNA H. sapiens 114 ggatcttgga aatcaatggc 20 115 20 DNA H. sapiens 115 ttggaaatca atggcacgcc 20 116 20 DNA H. sapiens 116 ctgctccagc tgaccctcga 20 117 20 DNA H. sapiens 117 cagctctgcc cggcagaaac 20 118 20 DNA H. sapiens 118 ctgcccggca gaaacctgtc 20 119 20 DNA H. sapiens 119 gcctcccagc gcaaggacct 20 120 20 DNA H. sapiens 120 atcaaggtga cacaccgtga 20 121 20 DNA H. sapiens 121 aggtgaaggt catgcgatgc 20 122 20 DNA H. sapiens 122 aaggtcatgc gatgcctgga 20 123 20 DNA H. sapiens 123 catcatcaag agcatggaca 20 124 20 DNA H. sapiens 124 tcaagagcat ggacagccag 20 125 20 DNA H. sapiens 125 gcatcaggga tggcctacct 20 126 20 DNA H. sapiens 126 tgaacatcat ccaccgagac 20 127 20 DNA H. sapiens 127 ggtccgcgag aacaagaatg 20 128 20 DNA H. sapiens 128 ccgagaagag gccatccttt 20 129 20 DNA H. sapiens 129 cccgactgag ccagggccac 20 130 20 DNA H. sapiens 130 gattcctccg cgggaggtgg 20 131 20 DNA H. sapiens 131 ctctctgccg tggcccagag 20 132 20 DNA H. sapiens 132 ggtccatagc tccctggagg 20 133 20 DNA H. sapiens 133 ccaggaacca aatggggagt 20 134 20 DNA H. sapiens 134 agctgcggtg ccctggccgg 20 135 20 DNA H. sapiens 135 acggctgcag cagcagctgc 20 136 20 DNA H. sapiens 136 gtattaaaac tgtgaagctt 20 137 20 DNA H. sapiens 137 tgtgaagctt tctcagtgca 20 138 20 DNA H. sapiens 138 tttcatacgg ctgcagcagc 20 139 20 DNA H. sapiens 139 cagctgggtg ctttcactcc 20 140 20 DNA H. sapiens 140 cagtaactca agggtctgag 20 141 20 DNA H. sapiens 141 gcttgaaccc gagaggcagt 20 142 20 DNA H. sapiens 142 tctcctgcag gtgttgtgac 20 143 20 DNA H. sapiens 143 tgccttgcac actcacctgg 20 144 20 DNA H. sapiens 144 gctcttgcag cgggcactgc 20 145 20 DNA H. sapiens 145 cccaggagga gctgcagcat 20 146 20 DNA H. sapiens 146 ggcctcccag cgcaaggacc 20 147 20 DNA Artificial Sequence Antisense Oligonucleotide 147 ttctacctcg cgcgatttac 20 148 20 DNA Artificial Sequence Antisense Oligonucleotide 148 tacgtccgga ggcgtacgcc 20 

What is claimed is:
 1. A compound 8 to 80 nucleobases in length targeted to a nucleic acid molecule encoding LIM domain kinase 1, wherein said compound specifically hybridizes with said nucleic acid molecule encoding LIM domain kinase 1 and inhibits the expression of LIM domain kinase
 1. 2. The compound of claim 1 which is an antisense oligonucleotide.
 3. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified internucleoside linkage.
 4. The compound of claim 3 wherein the modified internucleoside linkage is a phosphorothioate linkage.
 5. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified sugar moiety.
 6. The compound of claim 5 wherein the modified sugar moiety is a 2′-O-methoxyethyl sugar moiety.
 7. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified nucleobase.
 8. The compound of claim 7 wherein the modified nucleobase is a 5-methylcytosine.
 9. The compound of claim 2 wherein the antisense oligonucleotide is a chimeric oligonucleotide.
 10. A compound 8 to 80 nucleobases in length which specifically hybridizes with at least an 8-nucleobase portion of a preferred target region on a nucleic acid molecule encoding LIM domain kinase
 1. 11. A composition comprising the compound of claim 1 and a pharmaceutically acceptable carrier or diluent.
 12. The composition of claim 11 further comprising a colloidal dispersion system.
 13. The composition of claim 11 wherein the compound is an antisense oligonucleotide.
 14. A method of inhibiting the expression of LIM domain kinase 1 in cells or tissues comprising contacting said cells or tissues with the compound of claim 1 so that expression of LIM domain kinase 1 is inhibited.
 15. A method of treating an animal having a disease or condition associated with LIM domain kinase 1 comprising administering to said animal a therapeutically or prophylactically effective amount of the compound of claim 1 so that expression of LIM domain kinase 1 is inhibited.
 16. A method of screening for an antisense compound, the method comprising the steps of: a. contacting a preferred target region of a nucleic acid molecule encoding LIM domain kinase 1 with one or more candidate antisense compounds, said candidate antisense compounds comprising at least an 8-nucleobase portion which is complementary to said preferred target region, and b. selecting for one or more candidate antisense compounds which inhibit the expression of a nucleic acid molecule encoding LIM domain kinase
 1. 17. The method of claim 15 wherein the disease or condition is a developmental disorder.
 18. The method of claim 15 wherein the disease or condition is a neurological disorder.
 19. The compound of claim 1 targeted to a nucleic acid molecule encoding LIM domain kinase 1, wherein said compound specifically hybridizes with and differentially inhibits the expression of one of the variants of LIM domain kinase 1 relative to the remaining variants of LIM domain kinase
 1. 20. The compound of claim 19 targeted to a nucleic acid molecule encoding LIM domain kinase 1, wherein said compound hybridizes with and specifically inhibits the expression of a variant of LIM domain kinase 1, wherein said variant is selected from the group consisting of LIM domain kinase 1, LIMK hypothetical-1 and dLIMK. 